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Exploring the Selection of Galaxy Clusters and Groups: 
An Optical Survey for X-ray Dark Clusters 



m 
o 
o 

-i— > 
O 



oo 

(N 



> 
OS 

o 

00 

O 

m 
o 

Oh 
6 



13 



David G. Gilbank 1 * f, Richard G. Bower 1 J, F. J. Castander 2 and B. L. Ziegler 3 ^ 

1 Department of Physics, University of Durham, South Road, Durham, DH1 3LE, UK 
2 Institut d'Estudis Espacials de Catalunya/CSIC Gran Capita 2-4, 08034, Barcelona, Spain. 
3 Sternwarte, Geismarlandstr. 11, 37083 Gottingen, Germany 



2 February 2008 



ABSTRACT 

Data from a new, wide field, coincident optical and X-ray survey, the X-ray Dark Cluster 
Survey (XDCS) are presented. This survey comprises simultaneous and independent searches 
for clusters of galaxies in the optical and X-ray passbands. Optical cluster detection algo- 
rithms implemented on the data are detailed. Two distinct optically selected catalogues are 
constructed, one based on I-band overdensity, the other on overdensities of colour-selected 
galaxies. The superior accuracy of the colour-selection technique over that of the single pass- 
band method is demonstrated, via internal consistency checks and comparison with external 
spectroscopic redshift information. This is compared with an X-ray selected cluster catalogue. 
In terms of gross numbers, the survey yields 185 I-band selected, 290 colour selected and 15 
X-ray selected systems, residing in ~1 ldeg 2 of optical + X-ray imaging. 

The relationship between optical richness/ luminosity and X-ray luminosity is examined, 
by measuring X-ray luminosities at the positions of our 290 colour-selected systems. Power 
law correlations between the optical richness/ luminosity versus X-ray luminosity are fitted, 
both exhibiting approximately 0.2 dex of intrinsic scatter. Interesting outliers in these cor- 
relations are discussed in greater detail. Spectroscopic follow up of a subsample of X-ray 
underluminous systems confirms their reality. 

Key words: Galaxies: clusters: general, X-rays: galaxies: clusters, Surveys, Cosmology: mis- 
cellaneous 



1 INTRODUCTION 

Clusters of galaxies are extremely important astrophysical tools. 
They are the most massive virialised objects in the Universe. Since 
clusters form from extremely high peaks in the initial density field 
on scales of around 10 h~ 1 Mpc, they are sensitive to the ampli- 
tude of the power spectrum on these scales. Thus, observations of 
the cluster mass function out to large redshifts can place tight con- 
strain ts on cosmological parameters (e.g. Q m , as, A; lEke et all 
1996). They are also powerful laboratories for studying galaxy for- 
mation and evolution. Several different techniques exist for finding 
clusters, each relying on different properties of clusters in order to 
locate them, and it is important to try to understand how the selec- 
tion method may bias the sample and affect the scientific results. 

The first attempt at a large, h omoge neous survey for galaxy 
clusters was conducted by Abell| ll958l) . This was a phenome- 
nal effort by one individual to identify overdensities of galaxies 
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by visual inspection of Palomar Observatory Sky Survey (POSS) 
photographic plates, yielding nearly 1700 clusters in his "homo- 
geneous statistical sample" and over 2700 in his full sample. 
Similar catalogu es we r e constructed by Zwicky and collaborators 
IZwickv & et alll968h . lAheiiNl958l) 's Northern catalogue was ex- 
tended to the Southern hemisphere by Abell et al. 1 1989), applying 
his same statistical criteria. 

With the advent of space-based X-ray telescopes, such as 
UHURU, a new way to discover galaxy clusters was found. Spa- 
tially extended, thermal X-ray emission was detected and shown to 
be due to the hot intracluster medium (ICM) - the plasma trapped 
in a c luster's potential well I Mit chell et alll976llSerlemitsos et ail 
1 19771) . This provided a way to show that the cluster was a gen- 
uine physically bound system. Furthermore, the background signal 
(produced by X-ray point sources) is lower in the X-ray sky than 
the background in the optical, produced by a much greater surface 
density of foreground and background galaxies. Optical selection 
techniques lost favour: their main disadvantage being that there was 
no way, at the selection stage, to distinguish between genuine clus- 
ters and chance projections of less massive galaxy groups along 
the lines of sight. Extensive discussions of this contamination have 
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been published (e.g. lKatgert et all 19961 : Ivan Haarlem et alll997l) . 
To reject such spurious systems, observationally expensive spec- 
troscopy is required to confirm the overdensities in 3D. Despite 
the revolutionary new X-ray techniques, four large optical pho- 
tographic cluster surveys with follow up spectroscopy were un- 
dertaken in the late 1980s iGunn et alJll986t Icouch et aljfl99lt 
Madd ox"et all 19 88 ; Lumsd en et alj 19921) . The first two used visual 
inspection of photographic plates, and the second two utilised ma- 
chines which automatically measured parameters of objects from 
photographic plates. The catalogues derived from plate scanning 
could be passed to computerised overdensity detection algorithms, 
and for the first time cluster detection advanced beyond subjective 
visual inspection. 

Prior to the construction of large X-ray selected samples of 
clusters, it was natural to target the optically selected clusters de- 
scribed above with X-ray telescopes in an attempt to measure the 
X-ray luminosity function (XLF) at high redshift. ICastander et alJ 
( 1994) used ROSAT to observe cluster candidates in the re dshift 
range 0.7-0.9 from a 3.5 square degree subsample of IGunn et alJ 
( 1986)'s optical cluster catalogue a nd also found surprisingly weak 
X-ray emission (~10 43 erg s -1 ). iBower et alJ 1 19941) undertook 
ROSA T X-ray obse rvations of optically sel ected clusters from the 
ICouch et all Il99ll) catalogue <Couch et ai1ll99ll visually selected 
clusters based on the density enhancement of galaxies above the 
mean background, but tested their method exhaustively against 
simulated data). From this 46 deg 2 catalogue. IBower et alJ 1 1994) 
took clusters with reliable spectroscopic follow up and X-ray data 
in the redshift range 0. 15 to 0.66, assuming this to be a random sub- 
sample of the full catalogue. The total sky coverage of this survey 
was 26.8 deg 2 and contained 14 clusters. The X-ray luminosities of 
all but two of the clusters was found to be surprisingly weak - less 
than 5 x 10 43 erg s _1 . This decrease with respect to the locally mea- 
sured value was attributed to evolution in the XLF between z=0 and 
«0.4. The alternative is that if the XLF does not evolve between 
these redshifts, then the missing X-ray luminous clusters must be 
made up of optically poorer systems, missing from this sample. 
This raises the question do optical and X-ray surveys sample the 
same clusters? 

With the advent of high quantum-efficiency, large format 
charged-coupled devices (CCDs) in the early 1990s, optical clus- 
ter studies are again becoming attractive. The first serious attempt 
at an automated optical CCD survey with a quantifiable selec- 
tion function w as carried out with t he Palomar Distant Cluster 
Survey (PDCS, Postman et al. 1996). Their pioneering work in- 
volved assuming a model for the spatial and luminosity distri- 
bution of galaxies in a cluster and in the field, and filtering the 
data using these models as templates. Using a likelihood analy- 
sis of the data, with cluster richness and redshift as free param- 
eters, the most likely cluster candidates could be extracted, and 
their redshifts estimated as a by-product of the process. The tech- 
nique is known as the matched-filter (MF) and is discussed in 
more detail in ij3| This method reduced spurious clusters due to 
projection effects compared with the more traditional techniques 
described above, but many still remained (discussed further be- 
low). The MF need only be used on photometric data from a sin- 
gle passband, but with an additional filter other techniques are 
possible. Algorithms using colour selection have been pro posed 
(e.g. iGal et alJ200(ilGladders & Yeei200fJ) . lGal et alJbOOd) used 
mild colour cuts to reduce contamination due to field-like galax- 
ies. Since elliptical galaxies are predominantly found in dense en- 
vironments (the morphology-density relation, |PressleJl98fj) . and 
exhibit only a narrow range of colours at a given redshift, in any 



environment, data can be filtered in c olour to remove g alaxies with 
colours incompatible with ellipticals. Gladders & Yee (2000) took 
the colour selection a stage further, placing very strict colour cuts 
in two-colour data, to only search for overdensities of galaxies with 
colours consistent with elliptical galaxies at a given redshift (see 
[J3j. This works because in all known clusters for which multi-band 
photometry exists (regardless of how the cluster was selected), a 
tight relation exists between the colour and magnitude of its early- 
type galaxies (e.g. Visvanathan 1978; Bower et al. 1992). This re- 
lation is clearly visible over small spatial scales (~ the size of the 
cluster core), as early-type galaxies are predominantly found in the 
central regions of a cluster (Dressier 1980). 

With an abundance of new wide-fie l d optical and X- 
rav JjonesetalJll998t IVikhlinin etallll998t iMason et alJl200Ct 
iRomer et alj|200d) imaging data, it is timely to directly compare 
clusters found using these different methods. To recap, X-ray se- 
lected clusters are required to have a hot, dense intracluster plasma; 
whereas optical selection just requires an overdensity of galaxies. 
To this end, we have undertaken optical and X-ray imaging surveys 
in exactly the same regions of sky. We refer to this as the X-ray 
Dark Cluster Survey (XDCS), as the project is specifically aimed 
at searching for the optically rich but X-ray underluminous clusters 
described in lBower et alll994llBower et alll997l 

A plan for the outline of this paper is as follows. The X-ray 
field selection, optical observations and data reduction are pre- 
sented in [J2| Two optical cluster detection algorithms are pre- 
sented in J3| the first uses only single band optical photometry, 
and is a variant of th e now widely-used Matched Filter algorithm 
IPostman et aljfl996l) ; the second utilises c olour information, and 
is our implementation of the algorithm of Gladders & Yce 2000. 
[J4| discusses measures of optical richness, and the construction of 
the optical catalogues. SjSJdeals with the X-ray selection of clusters. 
These samples are cross-compared with the X-ray sample in J6| 
Spectroscopic follow up for a subsample of X-ray underluminous 
clusters is presented in iJT] and finally our conclusions in §8. 



2 THE X-RAY DARK CLUSTER SURVEY 
2.1 Sample Selection and Observations 

X-ray imaging is observationally expensive. Thus we chose to 
base our survey on archival X-ray data, which is relatively in- 
expensive to follow-up with wide-field optical imaging. Essen- 
tially a random sample of deep extragalactic X-ray fields was re- 
quired. The ROSA T International X-ray/ Optical Survey (RIXOS, 
IMason et all2000l) provided an ideal list of such fields. Their sam- 
ple was constructed from ROSAT Position Sensitive Proportional 
Counter (PSPC) fields which had exposure times of at least 8ks. 
This ensures that sources at the intended survey flux limit (for a 
point source) of 3 x 10 -14 erg cm -2 s _1 (0.5-2.0 keV) lie sig- 
nificantly above the sensitivity threshold of every field. They also 
limited the choice of fields to those which have Galactic latitudes 
greater than 28° in either hemisphere, since RIXOS is primarily 
intended for extragalactic source studies. 

The archival ROSAT fields listed in TableQwere observed in 
the optical using the Wide Field Camera (WFC) on the Isaac New- 
ton Telescope (INT), La Palma. The observations were carried out 
in two runs, in June 1998 and January 1999. The median seeing 
for the two runs was around 1.0" and 1.6" respectively. Conditions 
for both runs were photometric. The inner >19 arc minutes of the 
PSPC fields were imaged to depths of V~24 and I~23 (50% com- 
pleteness). This is the region of the PSPC used for X-ray source 
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Figure 1. INT WFC Tiling Strategy. The circle represents the inner 19 ar- 
cmin radius of the PSPC field. Rectangles show the four CCDs of the WFC. 
Solid lines indicate the camera configuration in one orientation and broken 
lines show the camera rotated through 1 80° . The diagram illustrates how 
the field can be efficiently imaged in two pointings. 

identification in RIXOS, to ensure th e best X-ray i mage quality and 
the most accurate source positions jMason et alj|200(il) . For each 
band, two exposures were taken, rotating the camera through 180 
degrees, and offsetting the centre of the pointing in order to ensure 
optimum coverage of the PSPC, as shown in Fig.Q Hereafter, im- 
ages taken with the camera rotator angle set to degrees will be re- 
ferred to as A images; and those with a rotator angle of 180 degrees 
B images. The two independent observations of a large fraction of 
the survey area will provide important internal checks of galaxy 
photometry and of cluster catalogues generated from independent 
data covering the same area of sky. We shall use the term "mosaic" 
to refer to a single pointing (either A or B) comprising the four 
WFC chips. 

The WFC comprises four thinned EEV (2048 x 4100) 
Charge-Coupled Device (CCD) chips, at the prime focus of the 
2.5 metre INT. The science devices have 13.5 micron pixels 
(0.333"/pixel). Each covers an area of 22.8 arcmin x 11.4 arcmin 
on sky. The total sky coverage per exposure is 0.29 square degrees. 
A single exposure covers 76% of the ROSAT PSPC area. By us- 
ing two exposures, virtually the entire inner 38 arc minute diameter 
was covered: see Fig.Q 

2.2 Optical Data Reduction 

The data reduction was carried out using mostly standard IRAF 1 
routines, and is detailed below. 

2.2.7 Debiasing and Linearity Correction 

Bias frames were visually inspected for quality. Master bias frames 
were constructed for each night by taking the mean of all the 
good data using ZEROCOMBINE. Next it was necessary to cor- 
rect for the non-linear response (due to problems with the ADC) 
of the WFC. This was achieved by applying polynomial correc- 
tions taken from the Cambridge Wide Field Survey (WFS) web- 
page jhttp : / /www . ast . cam, ac ■ uk/~wf csur/^ . The tem- 
porally nearest fits were used. The August 1998 polynomials were 
used for the XDCS data taken in June 1998 and the October 1998 

1 IRAF is distributed by the National Optical Astronomy Observatory 
which is operated by AURA Inc. under contract with the NSF. 



values for January 1999 data (the non-linearity was found to re- 
main stable between October 1998 and August 1999). Following 
this correction, the camera residual non-linearity is estimated by 
the WFS team to be less than 0.5% of the sky level. 

2.2.2 Flatfielding and Defringing 

Flatfielding was carried out using master flats for each night. These 
were constructed from a median combination of all the science 
data in the V-band and from twilight sky flats in the I-band. The 
I-band science data were not used to construct master flatfields as 
the thinned CCDs suffer from fringing. The fringing patterns are 
additive, but their broad structure is largely stable with time. There- 
fore, all the I-band data for each chip were averaged together to 
make master fringe frames for each night. Although the shape of 
the lines is stable, the amplitude can vary considerably (although 
always at the level of a few percent of sky). Thus, a method was 
needed to scale the amplitude of the fringe mask to the amplitude 
of the fringes in each data frame, before subtraction. This was done 
with software kindly provided by Mike Irwin. The scaled fringe 
masks were then subtracted from the I-band science images. After 
this procedure the level of fringing is reduced to ~0.5 percent of 
sky, which is of the order of the residual non-linearity. 

2.2.3 Object Detection and Photometry 

The SEXTRACTOR package jBertin & Arnoutslll996T) v2.2. 1 was 
used to locate and classify objects in the optical data. Firstly the 
seeing was measured for each frame. This was accomplished by 
fitting Gaussian PSFs to all bright (> 10cr) detections, classified 
by SEXTRACTORas stars. 

SEXTRACTOR was run in double image mode on each pair of 
aligned V/I images using the V-band image as the detection image, 
and the I-band as the measurement image. This was done in order 
to obtain SEXTRACTOR MAG.BEST magnitudes to measure the 
total magnitude for I-band objects. Ideally one would like to use 
the redder passband for object detection as the number counts are 
shallower for redder bands (i.e. a lower background for cluster de- 
tection), but it was found that the residual fringing in the I-band 
posed a problem for object detection in a few frames, dramatically 
increasing the numbers of false detections in the most badly-fringed 
frames. Although the level of the remaining fringes is too small 
to affect the photometry (the photometric error is still dominated 
by the linearity correction at the bright end, and Poisson noise at 
the faint end), the background estimation method used by SEX- 
TRACTOR cannot model the fringes. High residuals pass above the 
SEXTRACTOR detection threshold and are classified as extended 
objects. As this method could not be reliably used on all the data, 
its use was decided against. The depth of the data is such that, by 
limiting the object catalogues to I — 22.5, few objects are missed 
which would have been detected in the I-band image (see magni- 
tude limits in Fig.|6j and the use of the V-band to perform detection 
is entirely one of operational ease. The survey is therefore I-band 
limited. 

After generating a list of object positions from SEXTRACTOR 
the x and y coordinates of objects in the V frame were logged and 
used to position the aperture for photometry. For each V/I frame 
pair, the frame with the better seeing was convolved to that of the 
worst, using Gaussian convolution. The IRAF task phot was used 
to perform aperture photometry to measure colours, using an aper- 
ture of diameter 2.6 x the seeing (e.g. Lilly et aljfl99ll) . on the 
aligned, convolved V and I frames. 
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Table 1. List of ROSAT Fields in XDCS Columns give: RIXOS ID of field; name of the target of the original ROSAT pointing; RA, Dec; exposure time of 
field in RIXOS survey; overlapping X-ray cluster survey (V - VMF, S - SHARC); exposure time of the VMF or SHARC pointing (to give an indication of the 
depth to which they could have searched for X-ray emission). 



RIXOS Target a (J2000) 8 (J2000) T exp Overlapping T exp (overlap) 

ID [hh:mm:ss.s] [dd:mm:ss] (ks) Survey (ks) 



R110 


LHS 2924 


14:28:43.17 


+33:10:45.47 


18.3 


V 


28.5 


R116 


NOWER2 


12:03:60.00 


+56:10:11.99 


30.1 


— 


— 


R122 


Meaty 


16:29:24.00 


+78:04:48.01 


38.5 


S 


34.4 


R123 


1116+215 


11:19:4.80 


+21:18:36.00 


25.0 


V 


32.2 


R126 


ON 231 


12:21:33.60 


+28:13:48.00 


10.4 


V 


12.5 


R133 


CYUMA 


10:56:55.20 


+49:42:0.00 


9.4 


V 


7.9 


R205 


PI 00578 


23:12:21.60 


+10:46:48.00 


10.3 


S 


9.8 


R211 


S5 0716+7 


07:21:52.70 


+71:20:23.99 


21.0 


S 


17.3 


R213 


IRAS 0759 


08:04:31.20 


+65:00:0.00 


8.4 


V 


6.4 


R216 


S4 0917+6 


09:21:36.00 


+62:15:35.99 


19.5 


V 


15.9 


R217 


1411+442 


14:13:48.00 


+44:00:0.00 


25.3 


V 


22.5 


R220 


RXJ1726. 


17:26:12.00 


+74:31:11.99 


10.6 


V 


8.1 


R221 


E0845+378 


08:48:19.20 


+37:40:11.99 


12.4 


V 


10.0 


R223 


CM DRA 


16:34:24.00 


+57:09:0.01 


47.5 


V 


37.3 


R224 


HZ43 


13:16:24.00 


+29:06:0.00 


34.9 


S 


18.3 


R227 


GDI 40 


11:36:33.51 


+29:47:60.00 


33.9 


V 


26.6 


R228 


GBS0839+3 


08:38:47.90 


+36:31:12.00 


11.0 


V 


9.2 


R231 


Survey Fi 


10:10:16.70 


+54:45:0.00 


16.8 


V 


14.4 


R236 


Q 1700+5 15 


17:01:23.90 


+51:49:12.00 


8.2 


V 


6.5 


R245 


H0323+022 


03:28:25.82 


+02:47:57.84 


25.7 


S 


24.5 


R248 


3C216 


09:09:33.50 


+42:54:0.01 


23.6 


V 


19.9 


R254 


MRK 273 


13:44:43.10 


+55:53:24.00 


17.1 


V 


28.1 


R255 


0755+37 


07:58:28.70 


+37:47:24.00 


16.0 


S 


15.5 


R257 


B2 0902+3 


09:05:31.10 


+34:07:48.00 


14.5 


V 


26.5 


R258 


1115+080 


11:18:16.70 


+07:46:12.00 


14.4 


V 


13.2 


R262 


520 


01:24:33.50 


+03:47:60.00 


13.9 


V 


12.0 


R265 


B2 1308+3 


13:10:28.70 


+32:20:59.99 


13.0 


V 


7.6 


R268 


MRK 463 


13:56:2.30 


+18:22:12.00 


11.6 


V 


18.3 


R272 


3C 371 


18:06:50.40 


+69:49:12.00 


10.5 


S 


8.0 


R273 


1040+123 


10:42:45.51 


+12:03:36.00 


10.2 


V 


8.4 


R274 


1404+226 


14:06:21.60 


+22:23:60.00 


10.1 


V 


6.7 


R278 


MKN 789 


13:32:24.00 


+11:06:36.00 


9.6 


V 


9.1 


R281 


IIIZW2 


00:10:28.70 


+10:58:12.00 


9.1 


V 


16.8 


R283 


1H 0414+0 


04:16:52.70 


+01:05:24.00 


9.0 






R285 


PSR 0940+ 


09:43:43.20 


+16:31:12.00 


9.0 


V 


8.1 


R287 


MKN 40 


11:25:36.00 


+54:22:48.00 


8.8 


V 


7.7 


R292 


GLIESE 70 


01:43:21.50 


+04:19:48.00 


8.7 


V 


5.4 


R293 


GD90 


08:19:47.90 


+37:31:12.00 


9.0 


V 


7.3 


R294 


KUV 2316+123 


23:18:45.0 


+12:36:00.00 


9.5 







Objects with a SEXTRACTOR CLASS-STAR index of ^0.90 
were taken to be galaxies. Detections with FLAGS^4 were re- 
jected. This means blended objects and those with near neighbours 
are kept, but those with saturated pixels, or corrupted data (e.g. due 
to boundary effects) are rejected. 

2.2.4 Photometric Calibration 

The photometric data were converted to the standard Cousins sys- 
tem using observations of several lLandoltl 1 19921) standard star 
fields each night. Due to the small size of these fields, only the 
central chip sampled the standard stars well. Therefore an internal 
calibration of the other three chips to the reference central chip was 
performed by comparing the relative sky levels in the chips for each 
observation. The flux difference was then converted into a relative 
magnitude offset. Different offsets were measured for each run, due 
to servicing of the instrument between the two runs, which changed 
the gains of the devices. The uncertainty on the absolute calibration 



derived from the scatter between Landolt stars, and repeat observa- 
tions of the same fields was estimated to be better than 0. 1 mags. 



2.2.5 Astrometric Calibration 

Relative astrometry between the chips was performed by Mark Tay- 
lor in Cambridge using A and B observations of one of our fields. 
This gives an internal astrometric solution converting chip coordi- 
nates into global camera coordinates. This solution is available on 
the WFS webpage. A precise external astrometric calibration is not 
essential, as clusters are very extended objects, so specifying a clus- 
ter centre to within several arcseconds is sufficient. Therefore, the 
pointing position of the telescope was used as the centre of the in- 
strument and the internal solution described was used to transform 
chip coordinates to sky coordinates. This gives an external accu- 
racy of around 3 arcseconds or better, but the internal accuracy is 
better than 0.5 arcseconds. For the purpose of cluster detection it 
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Figure 2. I-band galaxy counts for both XDCS runs and model counts 
used in derivation of B gc . Error bars are standard deviation from field to 
field. Both runs are found to be in good agreement, suggesting that the 
photometry for XDCS is homogeneous. Overplotted is the completeness 
model (relative to Metcalfe et aL 200j|) used to allow data to / = 22.5 to 
be used. Magnitudes are SEXTRACTOR MAGJ3EST magnitudes. I-band 
counts from the literature are overplotted, and found to be in good agree- 
ment. The completeness is modelled by a single-sided Gaussian of width 
1.90 mags, centred on I = 20.86. 



is the relative positions between galaxies (i.e. the internal solution) 
that is important. 



3 OPTICAL CLUSTER DETECTION ALGORITHMS 

Two different cluster detection methods were applied to the optical 
data. Both use positional information to search for overdensities 
in the galaxy catalogues, but they do this in different ways. The 
first uses only the I-band photometry, and looks for overdensities 
of galaxies which appear to follow the luminosity function of a 
galaxy cluster; the second includes the V-band data and uses the 
V-I colour to search for the colour-magnitude relation of early-type 
cluster galaxies. 



3.1 The Matched Filter Algorithm 

The "matched-filter" (MF) was pioneered by Postman et al. 1996, 
and modified by se v eral groups including Kawa saki et alj ( 1998); 
Kepn er et alj < 19991) ; Lobo et alj (200fj). Its principle is to assume 
that galaxy clusters follow some well-defined model, in both their 
spatial and luminosity distributions, i.e. some universal radial pro- 
file is assumed for the distribution of galaxies in a cluster, which 
can be projected into 2D. In the same way, some universal lu- 
minosity function can be assumed for its member galaxies. High 
resolution N-body simulations suggest that viria lised objects fol- 
low a universal density profile (NFW iNavarro et alj I1997T) : ob- 
servations show that cluster profiles are comp atible with such a 
profile, but also with s impler analytic fits ICarlberg et alj[l997l ; 
Lubin & Postmanlll996t . The luminosity function of galaxies in 
clusters and the field is well-fitted by a Schechter function with 
mild luminosity evolution I Yee & Lopez-Cruz 1999), or mild lu- 
minosity and density evolution iLin et all 19991) . This model can be 
scaled to any redshift. The observed radial profile scales with red- 
shift according to the angular diameter distance, D, to the object, 
given by 



D = (c/H Q ){q z + (go - l)[(2g f)2 + 1) 1/2 - l]/«g(l + zf (1) 

where c is the velocity of light, Ho is the Hubble constant, qo is 
the deceleration parameter, and z is the redshift 2 . The observed lu- 
minosity distribution scales according to the distance modulus for- 
mula m — M = 5log(d,M P c) + 25 + k + e with corrections for 
bandpass shifting due to the redshift of the source (k-correction, 
fc) and evolutionary corrections of the stellar populations (e). m 
and M are the apparent and absolute magnitudes of the galaxy, and 
the luminosity distance, d, in Mpc is a factor of (1+z) 2 times the 
angular diameter distance, D. The only other free parameter the 
model needs is the richness of the cluster (i.e. some parameterisa- 
tion of the number of galaxies it contains). Thus, a model for the 
observed properties of a galaxy cluster of arbitrary richness and 
redshift is obtained. The final aspect to be taken into account is the 
distribution of field galaxies. This model can be derived from the 
data itself. It is assumed they are randomly distributed in position 
- explicitly ignoring the correlation between the positions of pairs 
of galaxies (see £13.1. 4t . The contribution of cluster galaxies to the 
total number of galaxies in any dataset will be small, unless the sur- 
vey consists of small fields targeted at clusters. Hence, by studying 
the number density and luminosity distribution of the whole sam- 
ple, a model for the field galaxies can be deduced. The luminosity 
function in the PDCS method had to be modelled by a power-law 
due to assumptions made in the derivation (see paper for details) 
which is generally a good fit to the data, depending on the magni- 
tude range observed (e.g. Metcalfe et al. 2001, Smail et al. 1995, 
see also Fig.|2j. 

Astronomical data comprising positions and photometry can 
be searched for regions where the likelihood of the data fitting this 
cluster+field model is high. Since the cluster model is a function 
of richness and redshift as a by-product of the detection process, a 
most likely richness and redshift for each cluster candidate is ob- 
tained as a by-product. 

The Postm a~et alJll99"3 algorithm made several approxima- 
tions (detailed in their paper) which have been removed and treated 
more fully by later workers. For example, their main approxima- 
tion was to assume that the data (galaxies) could be binned in both 
position and magnitude in such a way that each bin had sufficient 
datapoints that their distribution was Gaussian. This was replaced 
by a more general treatme nt which a ssumed Poisson distribution 
of the data by iKawasaki et alj ll998l) . The other key assumption 
which has been followed by all subsequent works until lLobo et alj 
l2000l) is that the models predict a unique combination of spatial 
and luminosity distributions at a given redshift. The drawback of 
this approach is that if a cluster is slightly larger or smaller in an- 
gular extent than predicted for its luminosity distribut ion, its sig- 
nal is r educed and the probability of detection lessened. lLobo et all 
(2000) circumvented this problem by choosing the combination of 
spatial and luminosity profiles which independently maximised the 
signal. 



2 Throughout we assume H=64km s _1 Mpc — , go = 0.1 unless otherwise 
stated. This is to provide consistency with the stellar population synthesis 
models made available to us. Adopting the currently favoured cosmological 
parameters of: Q m = 0.3, Qa = 0.7 and Ho = 70 kms — 1 would affect 
the derived luminosities by SL4% and inferred linear sizes by %2% over the 
redshifts considered. 
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3.1.1 A New Matched-Filter Algorithm 

The algorithm pre sented here is closest in spirit to the technique of 
lLobo et all l2000l) . in that the assumptions for the distributions in 
radial profile and luminosity have been decoupled. This method of- 
fers many advantages for this project, the main one being that it is 
unnecessary to assume a characteristic physical size for the model 
cluster, a priori, which will obviously help if unrelaxed systems 
have larger angular extent than virialised systems of the same rich- 
ness and redshift (as may intuitively be expected). This also offers 
some computational rewards which will be explained below. 
The assumptions for this model are: 

• Field galaxies are distributed randomly over the sky (the two 
point galaxy-galaxy correlation function is explicitly ignored), and 
that their magnitude distribution has the same shape throughout the 
survey but changes slightly in normalisation, from field to field (see 
Fig.HJ. 

All magnitudes in this section refer to the I-band. In principle 
any photometric passband can be used, but as red a band as possible 
is desired. This is due to the fact that the field galaxy counts steepen 
toward shorter wavelengths, so the contrast between the cluster and 
the field is greater at longer wavelengths (e.g. power-law slopes of 
0.2 8 and 0.40 were m easured in the I- and V-bands, respectively, 
bv ISmail et alll995l) . 

• Galaxy clusters appear as overdensities in this background 
distribution, and their visibility can be enhanced by filtering the 
galaxy catalogue with a Gaussian filter, the size of which is given 
by the typical sizes of galaxy clusters from the literature. 

• Galaxy clusters follow a Schechter (Schechter 1976) Lumi- 
nosity Function (LF), with fixed faint-end slope, and the normali- 
sation is given by the amplitude of the overdensity (i.e. the cluster's 
richness). The typical apparent magnitude m* of the LF is a func- 
tion of redshift. 

The maximum-likelihood estimator, C, o flCashH 19791) is then 
applied to the data. 



3.1.2 Implementation 

The algorithm was run on each mosaic separately. First, filtering 
of the spatial information was performed. The x and y positions of 
galaxies brighter than magnitude 22.5 were read in, and the mo- 
saic filtered with five Gaussian filters of different widths. The stan- 
dard deviation of each filter was taken from lLoboetalji2000h . The 
widths shall be referred to as W n for the n th filter (to avoid con- 
fusion with standard deviation as, later), but are equivalent to the 
Gang in their paper. The widths, W\,...,W$, ranged from ~0.35 
to ~ 1 .42 arcmins in steps of these represent the typical core- 
widths of clusters in the redshift ran ge ~ 0.2 to 1 .0. A cut-off ra- 
dius of 3 W n is used. Unlike lEobo et alj l2000l) . instead of using 
a regular grid, the positions of the galaxies themselves were used 
as the grid to centre each of t he Gaussian fi l ters. T his adaptive-grid 
method was also adopted by Ke pner et alJ 1 19991) and has the ad- 
vantage that it ensures adequate resolution in the core of a cluster, 
and saves computational expense by performing few calculations 
where the galaxy density is low. For each spatial filter the mean and 
standard deviation of the filter amplitude was calculated (the ampli- 
tude follows a Gaussian distribution), and all five filters normalised 
onto the same system (by subtracting the mean and dividing by the 
standard deviation). 

Peaks were then found in each filter, by sorting the list of 
signal amplitudes, retaining the highest value, and then searching 
down the list, removing detections which fell within a radius W„ of 



the peak, and retaining the next highest value which did not. Val- 
ues lower than a minimum threshold of 2.5 o were immediately 
rejected. 

The peaks from the five filters were then sorted and cross- 
correlated. If a peak was detected in more than one filter, the highest 
amplitude was retained and the duplicate detections removed. Two 
peaks were considered to be the same object if the distance between 
them was less tha n the mean of their scales (i.e. (W n + W^)/2) 
{ Lobo et al. 2000). This resulted in a single list of peaks, each with 
an associated scale (the filter width, W n , in which the highest signal 
was detected). 

A richness estimate of the candidate was then required, for 
use in the maximum-likelihood estimation. As a first pass estimate, 
the number of galaxies within a fixed angular radius was taken, 
for all candidates, regardless of its associated W n . (The decision 
to use a fixed angular search cell is explained below.) This number 
then had the number of background galaxies, scaled to the same 
area, subtracted from it. The background galaxy density was found 
by counting the number of galaxies in an annulus of radius 3 x 
W„ to 15 x W„. The W n value was used to ensure the annulus is 
sufficiently far from the cluster core to avoid contamination with 
cluster members. 

The importance of using a local estimate of the background 
can be seen by looking at the field to field variations in our data in 
the number counts in Fig.|2| These variations are due to a combina- 
tion of residual offsets in the photometric calibration and intrinsic 
cosmic variance. The local background number density was also 
used to re-normalise the expected number counts locally, for use in 
the maximum-likelihood calculation. The cumulative counts were 
used at 1=20.5, two magnitudes brighter than the limiting magni- 
tude, to ensure both a high number of objects and high complete- 
ness. 

In estimating galaxy number densities, the geometry of the 
mosaic field needed to be taken into account. To compensate for 
border effects, where the detect cell starts to fall off the edge of 
the field, a weighting function was constructed, taking account of 
the fraction of the detect cell area lost. This requires caution, as 
upweighting the signal from a few galaxies is likely to result in 
increased spurious detections, due to the uncertainty from using 
fewer galaxies. Thus a cut was made, rejecting candidates where 
the fraction of the area lost to borders is > 0.20. 

The Cash C statistic (below) was then applied to the data 
within a radius of 2.5 x Wi . Most other MF algorithms use a search 
radius fixed in physical units at the estimated redshift of the cluster, 
and lLobo et alJ j2000) use the radius which maximises the signal. 
Early experimentation with simulated clusters showed that just us- 
ing data within a fixed radius (of the smallest filter) was adequate 
and this is done for the sake of simplicity and computational speed. 
Since the same galaxies always enter the maximum-likelihood cal- 
culation, this makes the calculation much more stable. It also meant 
that a fair estimate of the likelihood could be found by a simple 
application of the Cash statistic, without recourse to bootstrap re- 
sampling the dete ctions to determine their significance, as needed 
by the lLobo etall <2000l) method. 

The results were sorted in order of increasing C (decreas- 
ing likelihood), and overlapping detection s removed using a 2D 
friends-of-friends groupfinding algorithm l Huch ra & GelleJl982h . 
The groupfinder started with the most significant point and searches 
within a fixed radius (the linking length) for another point. If a 
point was found, then the search was repeated within the same ra- 
dius around this new point. The search continued, linking all points 
within the linking length of a neighbour, until no more points could 
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be linked. Thus only the most significant candidate was retained 
and all linked neighbours were removed. This method was found to 
work better than just removing candidates within a fixed radius of 
each other, as this latter approach tended to either remove too many 
(unassociated) candidates (if the rejection radius was too large) or 
leave multiple detections of the same candidate around the periph- 
ery of a rich candidate. The friends-of-friends algorithm is a more 
natural method for associating related points. Through experimen- 
tation on fields with known clusters, a linking length of 500 pixels 
(~2.8 arcmin) appeared optimal. Finally, the distance between the 
highest likelihood point (the candidate centre) and the most dis- 
tant point from it joined to the group was recorded. This distance 
was then used as a characteristic radius to estimate the extent of 
the group. This will be important later for matching up overlapping 
candidates. 
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Figure 3. Distribution of normalised amplitudes from spatial filter. The dis- 
tribution follows a Gaussian, with a high-end tail which contains cluster 
candidates. 



3,1,3 Maximum-Likelihood Estimation 

Cash 1 1979) originally developed the maximum-likelihood method 
for application to general Poissonian problems (see original paper 
for full details). The Cash statistic, C, is given by equation|2| 



(7 = 2 IE 



(2) 



For the application here, E is the expected number of galaxies 
per unit area per unit luminosity. The number of galaxies can be 
broken up into cluster and field. Now, observing over a given area 
and luminosity range on the sky, E — E{9, to; z c ) where 8 is the 
angular radius of the search area, to is the galaxy magnitude, and 
z c is the redshift of the cluster. Thus the number expected within 
these spatial and luminosity ranges is: 



E = 



[Acj> c i(m) + b(m)] dQdm 



to 
and 



li — [A<f} c i(m' n ) + b(m' n )]dQdm 



(3) 



(4) 



where each i is a data point. 

dfi and dm are elements of solid angle and magnitude, respec- 
tively. In most MF algorithms this is modelled as a power-law, as 
is necessary in Postman et al. 1996 original implementation. How- 
ever, there is no reason to assume this model in our maximum- 
likelihood approach, and so the background number counts were 
taken from the data, with a model for incompleteness, relative to lit- 
erature counts, and a local normalisation (explained above). b(m) 
is the model for the field galaxy counts, as a function of magnitude; 
<j> c i (m) is a model for the cluster contribution. The lower limit of 
the integral was replaced with 1=16 in practice, as saturation sets in 
around this point, and there are so few galaxies this bright within 
the whole survey that number counts could not reliably be com- 
puted. A is a parameterisation for the richness of the cluster, such 
that the number of cluster galaxies n c — A(j> c i , with A normalised 
by 



A(j) c i(m)dm 



(5) 



The luminosity distribution, <f> c i, can be modelled by the 
Schechter function l Schech teJ 19761) : 

(j} c i(m) = 0.92(/>*exp{-0.92(a+l)(m-TO*)-exp[-0.92(TO-TO* 



In practice, one can determine A by measuring the excess 
number of galaxies in the search cell (see below), and so the most 
likely value of to* can be sought. Matched-filters are usually used 
to provide redshift estimates of clusters, and each trial to* value 
can be thought of as a matched-filter redshift estimate zmf, using 
some relation between M* and to* (e.g. |Cplless 1989). The pre- 
dicted ma gnitude of a passively -evolving M* elliptical, from the 
models of Koda ma & ArimotcTl 19971) was used here, for consis- 
tency with the CMR method, explained in EI3.2I Most implemen- 
tations of the MF assume some radial profile for the model cluster 
fixed in metric coordinates (e.g. lPostman et alll996l : lKepner et all 
1999). This then means that dQ, = dQ(zM F), where zmf is the 
Matched-Filter redshift. Eobo et alj ^20001 adopted the novel ap- 
proach of decoupling the assumed radial profile from zmf, and 
just used Gaussian profiles of several different widths. Here, this 
is taken a stage further and a fixed angular size of detect cell dQ 
was used. One simply changes the parameterisation of radial shape 
and cluster richness to be contained within the A coefficient, which 
becomes the number of cluster galaxies within a fixed angular area. 
We estimated this from the number of excess galaxies over the lo- 
cal background value. Fixing dQ. made the C statistic operationally 
easier and more stable. 



3.1.4 Simulations 

In order to test the accuracy, completeness, and spurious detection 
rate of the cluster-finder, an extensive set of simulations was run. 
First, a cluster model was required. The fiducial cluster required a 
model for the luminosity and spatial distribution of galaxies, since 
these are used by the detection algorithm. The spatial profile is 
given by the density profile of Navarr o_et alJl 19971) . projected into 
2D using the prescription of Bartelmann 1 1996). 

The luminosity profile was taken from the same Schechter 
function used by the detection algorithm. This employed the Mj=- 
20.68 given by the stellar population synthesis models used (Ko- 
dama et al. 1998); we adopted alpha=-l.l (as used by Postman et 
al. 1996. These authors noted that varying alpha between -0.6 and 
1.6 only alters the FWHM of the luminosity filter by %\5% rela- 
tive to the nominal alpha=-1.10, and has a minimal effect on the 
detectability of cluster candidates). The normalisation of the LF is 
varied, and the number of galaxies brighter than 7713 + 2 wit hin an 
Abell radius counted to give the Abell Richness Class (ARC. lAbelll 
Il958fr >. Clusters from ARC to 3 were simulated. The cluster mod- 
)]}(fijs were generated in physical coordinates and then transformed to 
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Figure 4. Matched-Filter Accuracy. Recovered estimate of m* against sim- 
ulated redshift. The dashed line shows the model used for the m* -z relation. 
Each point is the mean of 100 simulations. The points for the different rich- 
ness classes are offset slightly in z s i m , in the plot, for clarity. Error bars are 
1 a standard deviation between all simulations. 



different redshifts. For the distribution of field galaxies, points were 
put down randomly over the field of the WFC mosaic. Each point 
then had a magnitude assigned to it, extracted from the observed 
number counts. The number of galaxies in each realisation was al- 
lowed to vary according to the range of surface densities seen in the 
data. 

For each richness class of cluster, 100 realisations of cluster 
and field were produced at each redshift interval from z=0.2 - 1.0 
in 0.2 steps. These mock fields were then passed to the detection 
algorithm. The results are plotted in Fig.|4| 

The completeness was assessed from the same simulations. 
For every simulated cluster, the cluster was considered correctly 
recovered if a candidate centre lay within 2.5W\ of the simulated 
cluster centre, and its Cash C value lay below the threshold cutoff 
(explained below). 

To assess the number of spurious candidates detected, the al- 
gorith m was run on simulated blank fiel ds. Other author s (e.g. 
lKepneretallll999t iPostman etai]|l996t lLobo et alJ l200fj) have 
used random realisations of their data to represent blank (i.e. 
cluster-free) fields. However, this is likely to underestimate the 
false-positive rate, since the positions of galaxies on the sky are 
correlated. In order to account for the clustering, mock fields were 
generated in such a way that the positions of points obeyed the 
observed two-point correlation function, uj(0) - a measure of the 
number of galaxies at a given ang ular separation 8 - usually mod- 
elled as a power-law in 9 IPavis & Peeb les 1983). Such fiel ds were 
genera ted using the iterative tree technique of ISoneira & Peebles! 
1 1978). This was implemented using code kindly supplied by Ian 
Smail. lo(9) is further discussed in f|4] 

Mock fields were generated to match the geometry of the 
WFC, and the MF algorithm applied. The threshold for the Cash C 
statistic was found by experimentation until a reasonable compro- 
mise was found between completeness and false-positive detection 
rate. The rates for the final threshold are plotted in Fig. [5] A value 
of Cthresh = —155 in the units set out in the previous section was 
chosen. 

To summarise, the matched-filter uses only the I-band galaxy 
positions and magnitudes to attempt to find systems with luminos- 
ity functions resembling those of galaxy clusters. This particular 
algorithm is designed to generously join points likely to belong to 



a given overdensity together to allow for the possibility of irregu- 
lar, extended clusters, perhaps as yet unrelaxed. The performance 
of this algorithm is compared with simulated data in Figs.|4]and|5] 
Simulated clusters of various richness classes were generated. The 
accuracy of the estimated redshifts is worst for the poorest clusters, 
but even for these it should be better than Az=0.1 for redshifts less 
than about 0.7. Hereafter the accuracy decreases, as a large fraction 
of the galaxies drop below the completeness limit of the survey. 
Again, for redshifts less than ~0.7, the MF algorithm should find 
all clusters of Abell richness classes (Fig.[5}- The fraction of false 
detections is essentially zero below this redshift and rises (most 
rapidly for poorer clusters) hereafter. Thus, this algorithm should 
essentially recover all ARC>0 clusters with negligible contamina- 
tion below z~0.7. 

3.2 The Colour-Magnitude Relation Algorithm 

The CMR finder used was based on the Cluster Red Sequence al- 
gorithm o f Glad ders & Yeel fcOOOh . Their method is directly appli- 
cable to the XDCS data set, as they tested the algorith m on V- an d 
I c -band data of the CNOC2 field redshift survey <Yee et all200fl) . 
The CNOC2 survey comprises four fields of similar area and depth 
to each of the XDCS fields, although the total area of XDCS is an 
order of magnitude larger. The algorithm works by first filtering 
the data, leaving only those which are compatible with galaxies be- 
longing to a model colour slice in colour-magnitude space. Then 
the method proceeds in a similar manner to the previous methods 
- convolving the data points with a kernel and performing density 
estimation. However, there is now the added complication that the 
overdensity finding has to be done in 3D. 

3.2.1 Model CMRs 

The passive-evolution models of Kod ama & Arimotcl ll997h . with 
the cosmology Ho = 64 kms -1 Mpc -1 , and go=0.1, and a forma- 
tion redshift of Zf — 4.4, were used. These models reproduce the 
evolution of the CMR for clusters to z>l iKodama etalll998l) . A 
redshift is selected and the model colours as a function of magni- 
tude for this redshift extracted. A colour slice of width compatible 
with the scatter in the CMR is taken around this line. The slices are 
selected in colour space and constructed in such a way that each 
overlaps by half the width of the next slice, in order to ensure that 
cluster CMRs are not lost between adjacent slices. This leads to ir- 
regular redshift spacing (e.g. Fig|6j. Slices were chosen between 
V - I c = 1.4 and V - I c = 2.7. Bluer than this limit and the 
4000A break passes below the limit of the V-band filter, and red- 
der than this limit and the colour errors become unreasonably large. 
The model slices used are shown in Fig|6| 

Each of the 24 slices illustrated is confronted with the x,y- 
position, colour, colour-error and total I c magnitude data in turn. 
Each galaxy is then given a weight which is the likelihood that for 
the given V - I c , A(V - I c ), MAG_BEST(I C ), the galaxy belongs 
to the model CMR slice (errors in the I c magnitude are ignored 
as the CMRs are virtually horizontal). This weight shall be termed 
the colour weight. In practice, many galaxies are so far from the 
colour slice that their colour weights can safely be set to zero, thus 
galaxies with colour weights of less than 0. 1 (i.e. 10 per cent prob- 
ability of belonging to the CMR slice) are ignored. This is done for 
computational efficiency. 

As mentioned above, the aim is to run kernel density estima- 
tion on the data, using colour-based weights to amplify the sig- 
nal from cluster galaxies. As can be seen from the number counts, 



An Optical Survey for X-ray Dark Clusters 9 



ARC 3 
■■• ARC 2 

- □ ARC 1 
^> ARC 



Figure 5. Matched-Filter Completeness (left). For the same simulations as Fig. 4, the fraction of correctly-recovered clusters (see text) was calculated. 
Matched-Filter Spurious Detection Rate (right). As for adjacent plot, but now the points represent detections of cluster candidates in a blank but correlated 
field. See text for details. 




Figure 6. Model colour slices used in the CMR finder. Thick lines show the 
CMR at the redshift given to the left; stars illustrate the position of M* ; and 
dashed lines show the \a scatter in the CMR, bounding the slice. Colour 
slices run from M*-l to M*+3 (to a limiting magnitude of I c = 22.5). 
Alternate slices are shaded differently for clarity. The 50% completeness 
limit of the photometry is also shown. 



Figure 7. The redshift resolution of the CMR-finder. The binwidth in red- 
shift of the model slices (spaced constantly in colour), is shown as a func- 
tion of redshift. This illustrates that the method offers greatest sensitivity 
at z rsj 0.3, and the binwidths increase rapidly above z ~ 0.5. Also, in- 
creasing colour errors and incompleteness lead to increasing uncertainty in 
redshift estimation at the high end. 



the numbers of galaxies at faint magnitudes grows rapidly. Thus, 
if just the colour weights were used, spurious detections would be 
caused due to some fields containing large numbers of faint objects 
(many of which would have the same colour as the CMR slice). Put 
simply, brighter (and rarer) galaxies are more powerful diagnostics 
of cluster members. Hence, it is necessary to also apply magni- 
tude weights to weight brighter objects more heavily, within each 
colour slice. The form of this weighting function was determined 
bv lGladders & YedfeOOOh for the CNOC2 data. This function is the 
probability that a galaxy of a given magnitude is a cluster galaxy. 
It could be derived from theoretical models, but requires the cluster 
galaxy LF; the space density of clusters and its evolution; and the 
field galaxy counts: it i s simpler to deduce internally from the data. 
Glad ders & YedfeOOfJ) show that whether they assume 2% or 20% 
of all their galaxies lie in clusters, the fit to the weighting func- 
tion only differs in linear slope by a factor of 1.5 (although there 
is considerable scatter about the relation). Bearing this in mind, the 
function chosen here is a fit to the result in their Fig. 5. 



P(M) = 



0.55 

-0.08(M- 



M*) + 0.55 



(M* 
(M* 



-KM < M*) 
< M < M* + 3) 



(7) 



Once colour weights have been assigned, each galaxy is given 
a total weight for each colour slice which is just the product of the 
colour weight and the magnitude weight. The next step is to smooth 
the data with a kernel and estimate the density of the weighted 
points. In a change from the previous method, a regular grid is cho- 
sen. This makes several later stages computationally easier. A grid 
fixed in physical size (for the above cosmology) is constructed with 
the pixels spaced in intervals of 0.125 h _1 Mpc. The kernel chosen 
also differs from the G aussian kernel used in the above methods. 
Gladd ers & Yed l2000l) chose to use an exponential kernel of the 
form k(r) — Ae^ -1 ' 965 ^ where A is a normalising constant (al- 
though this is unnecessary, as a further normalisation step is car- 
ried out later in the algorithm, and so A is ignored here) and r is 
the physical distance between galaxies at the redshift of the colour 
slice, in units of 0.33 h _1 Mpc. They chose this kernel as it ap- 
proaches the shape of the NFW profile at intermediate radii (for the 
value of -1.965 chosen) and is constant provided that r is given in 
units of the NFW scale radius (a value of 0.3 3 h _1 Mpc is suggested 
by the CNOC1 survey JCarlbere et alll99d) ). 

Thus, running the above algorithm results in a series of grid 
points distributed over the field of view, each with an associated 
signal resulting from the convolution of the exponential kernel with 
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Figure 8. Histograms of 8ij values for CMR-finder. Three different colour 
slices are illustrated. The solid histogram shows the binned values for the 
whole survey. The thick, solid line shows the values for the bootstrapped- 
thresholded data, and the dotted line is the power-law extrapolation to the 
bootstrap. See text. 

the total weights. These signal amplitudes will be referred to as SijS 
in the notation of iGladders & Yee! 12000). Each colour (redshift) 
slice contains a different number of grid points (as the angular size 
of the field is fixed, but the physical scale at the redshift of the 
slice decreases with increasing redshift), and a different distribu- 
tion of S^s. The distribution changes as the fixed physical size ker- 
nel changes apparent size and the mean density of objects differs 
between redshift slices. Thus, the SijS need transforming into some 
standard measure of significance, correctly normalised between the 
colour slices. 

Several cluster detection algorithms <Gal et alJ l200Ct 
Lobo etalJ l200Ct IGladders & Yeel l200Ch have used bootstrap 
resampling techniques to assess the significa nce of det ections, 
and this was also done here. As noted by Gladders & Yee 1 2000), 
a direct application of the bootstrap is likely to be incorrect (as 
the data contains clusters and is therefore not independently 
distributed). So, exclusion cuts of 10% of the data at the high-Sij 
and low-Sij (to preserve symmetry) ends were performed; and 
each data point (comprising an x,y-position, colour, colour-error, 
and magnitude) was sampled, with replacement, until the original 
number of datapoints had been extracted. The bootstrapped 
datapoints were then run through the algorithm, resulting in a 
new distribution of bootstrapped 5ys. Each WFC mosaic had 
one bootstrapped realisation of its data made, as the process is 
computationally expensive. A power law fit of the high signifi- 
cance tail was extrapolated to the very rare, highest significance 
peaks. This was found to agree well with tests made for larger 
number of realisations. The high-5,;., tail (where the number of 
points is low) can be extrapolated well with a simple linear fit, 
rather than performing many more bootstrap resampli ngs. See the 
distributions illustrated in Fig. [8] and cf. fig. 7 of Gladders & Yee 
(2000). The probability that a given Sij occurs at random can be 
found by comparing the number of Sij's in a given range with 
the number in the same range in the clipped-bootstrap sample, 
P(Sa) = iv ^°°'''™^ i ;J 1 . These can then be stated as the 

X J ' "{"bootstrap! 

equivalent Gaussian sigma (denoted cr^) for convenience. 

The final step in this process is to extract the signif- 
icant peaks, and work out which peaks are associated (i.e. 
part of the same cluster). The above cry's form a datacube 
in x,y,z space, where x,y are the physical metric coordinates 



and z is the redshift of each slice. Gladd ers & Yeel <200fJ) 
used the clump-finding algorithm elf ind of IWilliams^i^n 
1 1994) to extract significant associations from the data. A user- 
friendly IDL version of this algorithm was downloaded from 
|http : / /www . astro . ufl . edu/~williams/clf ind/| 
and run on the data cubes using the parameters detailed in 
Gladd ers & Yeel fcOOd) . Briefly, the algorithm is a 3D friends-of- 
friends group-finder which also contours the data at fixed intervals 
and looks for clumps in this 4D space. The code was originally 
used with temperature maps in radio data, but the temperature can 
be replaced with the signal from the CMR-finder, and the method is 
identical. The highest peaks are identified first and traced down in 
contour levels, their friends above the minimum level being linked 
to them at each interval. Following this through, all points become 
joined into one clump as the lowest contour level approaches the 
noise within the data. From extensive simulations. IWilliams et all 
1 1994) recommend the d ata be contoured at in tervals of twice 
the rms noise in the data. IGladders & Yed l2000l) calculated this 
value to be \.\oij. Tests were performed varying this value. The 
resulting groups found were practically identical, but using a value 
of \Aoij seemed a slightly better choice. Using a lower value split 
off clumps around the periphery of higher significance clumps 
(described in more detail below). The peaks were traced down to 
the lowest possible contour level (Lloy). This level resulted in 
a total catalogue of ^1000 candidates. This number was reduced 
by setting a higher threshold later, by examining the repeatability 
of cluster detections in the overlapping data. A threshold of 4.8<r 
was found to result in a reasonable number (~200) of repeatable 
candidates, detected in the two independent images. This high 
value may suggest that the bootstrap estimate used may be an 
overestimate of the formal significance of candidates. It should be 
noted that changing the size of the high and low exclusion cuts 
used in the bootstrap realisations will change the absolute values 
of the significances. However, since the relative significance is 
correct, just selecting a subsample of the most significant systems 
is a perfectly valid approach. 

Thus a list of cluster candidates was extracted from the dat- 
acubes. One further stage was necessary to clean the resulting cat- 
alogues, as a number of candidates were found in close proximity 
to more significant candidates. 

These may be genuine groups infalling into larger clusters, or 
just spurious detections from increased noise around other candi- 
dates. Failure to remove these would result in the following prob- 
lem: when measuring properties (such as richness, see below) of 
the detected systems, if a cluster and poor group are superposed 
along the same line, very close together on the sky, then the effect 
on the richness measurement of the richer system would be min- 
imal; but the effect on the poorer system would be to catalogue 
another rich system (due to contamination from the richer cluster). 
Hence, a minimum distance in physical and redshift space was im- 
posed to prevent these duplicate detections, and only the highest 
peak within two cutoff radii (i.e. 8 times the NFW scale radius of 
0.33 h~ 1 Mpc) and two redshift slices retained. An estimate of a 
characteristic radius for each group was made (as for the MF al- 
gorithm) by taking the maximum distance between the candidate 
centre and the 3<r contour. 



4 RICHNESS MEASURES 

One of the simplest observables for a galaxy cluster, in opti- 
cal data, is its richness. The original richness classification of 
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I Abel] ! j 19581) has been shown to be subject to many biase s 
(e.g. lLucevlll983l : iKatgert et al1ll996t Ivan Haarlem et alJll997h . 
However, more recent e stimates such as the B gc parameter of 
lLongair & Seldne^ ^^9l) correlate well with cluster vel ocity dis- 
persion (e.g. lffill & Lilly! 199lllYee & L6pez-Cruzll99l . 



The unc ertainty in the B ac p arameter was computed using the 
formula from lYee & Lopez-Cm j <1999h : 



AB gc = (N net + 1.3 2 iV 6gli ) 1/2 

Bar. 



(9) 



B gc is explained in detail in lYee & Lopez-CruJ 1 19991) and refer- 
ences therein. It has been used by several workers, primarily in 
studi es of th e environments of radio galaxies (for recent exam- 
ples: I Andersen & Owenlll994l iMiller et allll999l) . In outline, it is 
found from the amplitude of the 3D two-point correlation func- 
tion. The 3D correlation function is difficult to measure; obser- 
vationally easier is the angular correlation function, uj{6). This 
is simply a measure of the number of galaxies at a given angu- 
lar separati on. This can be approximated as a power law u){6) = 
AggO 1 " 1 toavis & Peeblesll 19831 for example), where A gg is the 
angular galaxy-galaxy correlation amplitude. Now, fixing a refer- 
ence point as the assumed centre of the cluster, one can measure 
the two-point angular galaxy-cluster correlation function. Its am- 
plitude, Ag C , can be calculated by counting the excess number 
of galaxies (i.e. background-subtracted), within some radius, 0, of 
the cluster centre (N„ et = N to tai — N bgd ). Assuming fixed 7, 
A gc = (N net /N bgd )[(3 - f)/2)ei-\ 

B gc , the spatial amplitude, can be estimated via deprojection 
of the angular correlation func tion, a ssuming spherical symmetry, 
as given in Longair & Seldneil ll979t) : 

D~"~ 3 A c 

B gc = N bgd ^-tt77 77 (8) 

i 7 W|M (mo, z)\ 

where Nbgd is the background galaxy counts to apparent mag- 
nitude mo and ^[M{mo, z)] is the integrated LF of galaxies up to 
the absolute magnitude M, given by mo at the cluster redshift z. / 7 
is an integration constant arising from the deprojection (J 7 = 3.78 
for an assumed 7 of 1.77). D is the angular diameter distance to z 
(Eq uation!!). 

lYee & Lopez-Cruzl 1 19991) discuss extensively the effects of 
different assumptions/ measurement limits on B gc . The salient 
points are summarised here. If the assumption of a universal LF is 
not strictly correct, then the systematic uncertainty this introduces 
in Bg C is ~ 10%. Changing the parameters of the LF (slope and 
normalisation) results in < 20% differences in B gc if M* is in- 
correct by as much as 0.5 mags, and if a is incorrect by as much 
as ±0.3. Bgc is independent of the sampling area, provided 7 has 
been correctly chosen. B gc is insensitive to the sampling magni- 
tude limit if mum lies on the flat part of the LF (between M*+l 
and M*+2). The most important step is ensuring that the cluster LF 
and background galaxy counts are determined in a self-consistent 
manner. 

The model LF is the same as that chosen for the cluster in the 
MF algorithm. Fig.|2|illustrates how the model LF assumed for both 
the cluster and the field translates into field galaxy counts. The LF 
was integrated over 0.05 redshift bins from z = 0.00 to z = 2.00. 

The normalisation, <j>*, was fitted to the XDCS data. (j>* = 
0.0035 /i64 3 Mpc~ 3 was found. This is cons istent with the R-band 
value measured bv lYee & Lopez-CruJ ll999l) . after correcting the 
number density to their cosmology. The co mpleteness was mod- 
elled as XDCS counts/ literature counts (from Metcalfe et al. 2001 ) 
to Ii im = 22.5 (where the completeness falls to 70%). This factor 
was then applied to the expected counts for the B gc calculation (as 
well as to the MF algorithm, earlier). 



The factor 1.3 2 accounts for the clustered ( and so no n-Poissonian) 
nature of the background counts <Yee&Greerll987l) . 



4.0.3 The Le Measure 

The luminosity of galaxies on the CMR referred to as L_g (since 
the galaxies are primarily early-type), has been shown (for a limited 
sample) to correlate well with the X-ray temperature of the cluster 
( Smail et al. 1998). For a sample of the 10 mo st X-ray lum inous 
clusters in the redshift range z = 0.22 - 0.28. ISmail et alJ Il99^> 
investigated the homogeneity of the stellar populations of cluster 
early-type galaxies. One method they used was to compare mass of 
baryonic material locked up in stars in early-types (in the form of 
the luminosity of galaxies on the CMR) with the total mass of the 
cluster (using X-ray temperatures from the literature). They found 
a remarkably small scatter about this relation (« 17% compared to 
the «30% when Lxis used instead of Tx)- It should be noted that 
the sample spans a narrow range in redshift, and relatively narrow 
range in blue fraction (i.e. few galaxies statistically belonging to the 
cluster are not red), and mass. A large sample to characterise an em- 
pirical relation between Le and Tx(or mass) over a larger range of 
parameter space does not currently exist in the literature. However, 
the evolution of cluster mass-to-light ratios for a sample of 4 X-ray 
selected clusters over a wide range (0. 22<z<0.83) of redshift has 
been studied bv lHoekstra et alj 120021) . Using gravitational weak- 
shear measurements from HST images, they determined that the 
mass-to-light ratios in their sample evolve in a manner consistent 
purely with luminosity-evolution of the cluster early-type galax- 
ies. Thus, inverting this argument, measuring the rest-frame lumi- 
nosity of cluster early-type galaxies (corrected for passive evolu- 
tion) could potentially provide an estimate of the total cluster mass. 
Again it should be emphasized that the datasets on which these cor- 
relations were based are small and so the scatter in the relation is 
not well known. Furthermore, all the data came from X-ray selected 
samples, so the scatter may be further increased once optically se- 
lected clusters are included. 

For each cluster candidate, the colour slice from the CMR- 
finder in which the candidate was detected was selected. The galax- 
ies wit hin this colour sli ce, and within a radius of 0.45 Mpc (0.5 
Mpc in lsiiiail et alll99 8 cosmology) brighter than Mv = —18.5+ 
5 log h 1 Smail et al. 1998) were selected, and their apparent I-band 
magnitudes converted into rest-frame V luminosity, again using the 
stellar population synthesis models of Kodama & Arimotol ll997h . 
This magnitude limit is approximately 1.5 magnitudes fainter than 
L* at z~0.3. Background subtraction was carried out by calculating 
the number of galaxies in a surrounding annulus, scaled to the area 
of the cluster region, as above, and subtracting the corresponding 
luminosity, assuming these galaxies were at the same redshift as the 
cluster. The limits for the maximum and minimum luminosity were 
estimated by using all the galaxies whose photometric colour errors 
allowed them into or out of the colour slice, respectively, and sum- 
ming their luminosities in the same way. This gives error estimates 
in excellent agreement with assuming the error is entirely due to the 
error in the estimated redshift (by taking the redshifts of the next 
highest and lowest colour slices and recalculating the luminosities). 



12 D. G. Gilbank et al. 



4.1 Construction of Final Cluster Catalogues 

Both the MF and CMR finders were run on each WFC mosaic 
individually. Since each field possesses overlapping 'A' and 'B' 
data, the next step is to combine the candidates from the A- and 
B -rotations, for each algorithm. The MF technique is more straight- 
forward, so this will be discussed first. 

The MF catalogue was divided into two catalogues with differ- 
ent significance thresholds. The higher significance catalogue will 
be referred to simply as the MF catalogue, or the final MF cata- 
logue, if it is necessary to emphasize the distinction between this 
and the lower-significance catalogue: referred to as the full MF 
catalogue. The full MF is that using the thresholding described 
in £13.1.41 The final MF catalogue was produced by imposing a 
stricter Cash C cut (a value of -280, this time). The complete- 
ness and spurious rates with this threshold are comparable to those 
plotted in Fig. [5] for redshifts less than 0.7. Using higher redshift 
candidates just increases the number of spurious detections, and 
the X-ray data are unlikely to probe enough volume to detect clus- 
ters at these redshifts. Also, candidates with a MF group radius of 
zero were rejected. Such objects occur when only a single galaxy 
(and none of its near neighbours) lies above the Cash C thresh- 
old. This reduces the number of clusters detected to a more man- 
ageable number, ^200, of higher confidence candidates, whilst the 
full MF catalogue allows the list of lower significance candidates 
to still be retained. This may prove useful later, if, for example 
in cross-comparisons between catalogues, a candidate is not found 
with high significance in the MF catalogue; then the full MF cata- 
logue may be searched. 

Next, I-band WFC thumbnail images were produced for all 
candidates in the MF catalogue. These were inspected to see if a 
candidate was found due to spurious objects (e.g. satellite trails, 
haloes of bright stars). Those that were spurious were flagged and 
rejected from the catalogue. Finally, the MF catalogue compris- 
ing A- and B-rotation candidates was reduced to a single catalogue 
by searching for candidates which overlapped in the two rotations. 
Where this occurred, only the candidate with the larger group ra- 
dius was retained. This was found to be more stable than select- 
ing the highest peak Cash C value candidate, as the group radius is 
given by the extent of galaxies passing the Cash C cut; but the Cash 
C value noted for each candidate just comes from the galaxy with 
the highest individual value in the candidate. This can be thought 
of as favouring a larger "total likelihood" over that of a "peak like- 
lihood" for each cluster candidate. 

The procedure for producing the CMR catalogue was slightly 
more involved. This was due to the '3D' aspect of finding clus- 
ters with this technique. Whereas the MF finder just selects over- 
densities and fits the most likely redshift to the clump, the CMR 
finder can, in principle, detect projections of groups along the 
line of sight. The same first steps as for the MF were followed: 
I-band thumbnails were generated, spurious candidates rejected, 
and a higher and lower significance catalogue generated. The final 
CMR catalogue had a threshold of 4.8(7 imposed, as described in 
EI3.2I whereas the full (lower significance) catalogue allowed can- 
didates to be traced down to the lowest possible contour level with 
clumpf ind (i.e. 1.4cr). Candidates in the same rotation which 
showed more than one candidate overlapping (as defined by their 
group radii) were flagged as 'projection' possibilities. This check 
was performed on an individual frame basis to avoid the possibil- 
ity that a single candidate having a significantly different estimated 
redshift in the A- and B-rotations would result in one candidate 
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Figure 10. Final catalogues for this field. The thresholding described in the 
text has been applied. Dotted circle denotes MF candidate, solid line shows 
CMR candidate. Other symbols as for previous figure. 



being artificially classed as a projected system. The final CMR cat- 
alogue was produced by combining the rotations as for the MF. 

This provided the two main catalogues for the optical survey: 
the final MF catalogue and final CMR catalogue. These catalogues 
were then passed to the richness measuring algorithms described in 
[4] Finally, both catalogues were cropped to overlap with the X-ray 
data which is described below. To do this, only candidates within 
an annulus of 3 to 19 arcmins from the centre of the X-ray pointing 
were retained. At distances greater than this, the X-ray data are not 
useful (due to degraded resolution and sensitivity - see below) and 
the inner region was excised to avoid objects associated with the 
target of the X-ray observation. 



4.1.1 Summary of Catalogues 

In total, the final MF catalogue (constructed and trimmed to the 
PSPC field, as described above) contains 185 cluster candidates. 
The final CMR catalogue contains 290 candidates. The MF tech- 
nique fits the most likely value of M* in the range 17.0^Ic ^21.5. 
The bright limit is imposed by there being few galaxies this bright 
in the field, and at this redshift (z=0. 15) the angular diameter of 
clusters becomes so large that the contrast of the cluster against 
the background is greatly reduced. The faint limit (corresponding 
to z=0.9) is set such that the limiting magnitude is one magni- 
tude fainter than this M*, thus there are still many galaxies to 
which to fit a luminosity function. The CMR method is limited 
by the depth of the photometry in both bands (as illustrated in 
Fig.|6j, which leads to colour limits 1.45^V-I^2.65 corresponding 
to 0.159^z es t ^CO.703. In practice, the MF catalogue is cropped 
at the high redshift end to match the colour limits imposed by our 
choice of filters used with the CMR method. At the low redshift 
end, only a few clusters lower than the our CMR colour limit are 
found by the MF, so these are retained. 



4.2 Internal Check of Redshift Estimates 

To assess the accuracy with which an estimated redshift can be as- 
signed, an internal check can be performed comparing the redshifts 
estimated for candidates using the A- versus the B-rotation data. 
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Figure 9. Full catalogues in one field for MF (left) and CMR (right) algorithms. Points show galaxies with I-band magnitudes brighter than 22.5 (for B-rotation 

only, for clarity). Dashed lines denote the limits of the PSPC field (19 arcmins: outer radius, 3 arcmins: inner radius). Cluster candidates are outlined by points 

marking their group radii (defined in text). Symbols are: 

Left panel, MF candidates: filled circles - A-rotation; crosses - B-rotation. 

Right panel, CMR candidates: thick lines - A-rotation; thin lines - B-rotation. 

Those candidates which appear as single points have group radii of zero (see text). 



Due to the way in which the object catalogues are generated ( i|2.2> . 
different data are produced for the same region of sky using two in- 
dependent observations. For example, the main difference between 
object catalogues for the two rotations was the star/ galaxy classi- 
fication. Several objects classified as galaxies in one rotation were 
classified as stars in the other, and vice versa. The neural network 
classifier of SEXTRACTORuses both the FWHM of a source and its 
ellipticity to decide the nature of each source. The effect of FWHM 
differences was minimised due to the way in which the data were 
taken (observing the same A- and B-rotation fields sequentially) 
so that, unless the seeing is changing on very short timescales, 
the FWHM of point sources should be the same for the two ro- 
tations. Inspection of objects which changed class between the two 
frames showed that a slightly different measure of ellipticity was 
the primary cause. Overall, the level of star/ galaxy misclassifica- 
tion should be around the few percent level. From the spectroscopic 
observations presented in f0 two of the 87 redshifts measured (for 
objects brighter than I~20, classified as galaxies) were due to stars, 
or around 2%. The contamination is likely to be higher for fainter 
objects, where a lower signal makes shape parameters more diffi- 
cult to measure. To a lesser extent, the object catalogues between 
the two rotations differ due to cosmic rays, diffraction spikes, and 
differently deblended objects (as discussed in i|2.2> . 

A comparison of estimated redshifts between the two indepen- 
dent observations allows the effect of all these factors to be taken 
into account. This is one of the primary motivations for treating the 
repeat observations separately. Candidates for the comparison were 
selected in the following manner. For the MF catalogue, the full cat- 
alogue was compared with the final catalogue. For each entry in the 
final catalogue, if a single candidate from each rotation was present 
in the full catalogue, within the final catalogue entry's group radius 
(to avoid possible confusion with multiple matches), then the can- 
didate was selected. A similar procedure was followed for the CMR 
catalogue, with the added condition that the candidate must not be 
flagged as comprising projected groups (again to avoid confusion 
due to multiple matches). In both cases a limiting radius of 1 arcmin 
was imposed for the match, to ensure a high level of confidence that 



the same candidate had been selected from the two datasets. The 
comparison of the estimated redshifts from each rotation for each 
cluster detection algorithm is shown in Fig. 1111 Quantifying the 
bias and scatter in these relationships as: Sz — (za — zb)/(1 + za) 
(e.g. [Wittman et al. 2001), the mean value is -0.004 for the MF and 
-0.014 for the CMR algorithm; the standard deviations are 0.097 
and 0.081 respectively. This is somewhat misleading as the major- 
ity of the scatter from the CMR comes from a few outliers, and 
the majority of points show excellent agreement between the two 
independent observations. A large fraction of the outliers were de- 
tected in the final (z=0.70) colour slice in one rotation, and thus 
could easily be missed and associated with a less significant lower 
redshift clump in the corresponding rotation. The few other outliers 
can be understood in terms of marginal cases for projected systems. 
If each rotation detects two systems, the lower significance candi- 
date is measured as being more significant in the overlapping rota- 
tion, and the lowest significance system falls below the threshold in 
both cases, then a catastrophic failure of the redshift estimate would 
occur. This only appears to be the case for seven of the systems in 
the plot, at z est <0.69. The scatter in the MF estimate is intrinsi- 
cally large. Neither estimate shows any significant bias between the 
two datasets. 



5 X-RAY SELECTION OF CLUSTERS 

The X-ray data are archival images taken with the ROSAT Posi- 
tion Sensitive Proportional Counter (PSPC). Such observations are 
divided into two energy bands: "hard" (0.4—2.4 keV) and "soft" 
(0.07—0.4 keV). The background flux is particularly high at ener- 
gies below 0.5 keV and the sensitivity of ROSAT rapidly drops 
to zero above 2.0 keV. As a result, most X-ray cluster surveys 
use the hard band and cut its range down to 0.5—2.0 keV. All o f 
the fields were taken from the RIXOS survey jMason et alj200fj|) . 
which was an international campaign to follow up in the optical all 
X-ray sources in a sample of ROSAT PSPC fields above a point 
source flux limit of 3 x 10" 14 erg cm" 2 s" 1 (0.5-2.0 keV). Thus, 
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Figure 11. Comparison of estimated redshifts for the MF and CMR algorithms from the A versus B data. The left panel shows the MF estimated redshift in 
the B field versus the A field estimate of the same quantity. The right panel shows the A versus B estimated redshift for the CMR algorithm. 



a sample of clusters discovered in the X-ray in these fields have 
already been selected and confirmed. However, the algorithm with 
which these were found was optimised for point source detection, 
and not for locating extended emission, as expected for clusters of 
galaxies. Thus the RIXOS cluster catalogue is incomplete. This re- 
sulted in a claim for a deficit of high redshift clusters in the RIXOS 
sample (Castander et al. 1995), when other investigators found no 
evolution in the abundance of clusters (e.g. lNichol et alll997l) . 

The currently favoured technique for the detection of faint, 
extended sources in X-ray data is the wavelet method. Sources are 
detected by convolving the data with a kernel to enhance the con- 
trast between objects and the background, in the same way as de- 
scribed in i|3.1.2l for the optical algorithms. The difference with the 
wavelet method is that this kernel consists of a positive core and a 
negative outer ring (such that its integral over the x,y plane is zero). 
This means that slowly varying backgrounds which can be approx- 
imated by linear functions are completely subtracted. Furthermore, 
a wavelet transform of the data reveals sources bounded by a ring 
of zero values; the diameter of the zero-rings gives a measure of the 
angular extent of the source. In practice, a range of kernel values is 
used (as was done with the Gaussian filtering of the MF method), 
and these can be used to infer the source radi us. An instructiv e illus- 
tration of this technique is given in fig. 2 of Vik hlinin et alJ ll998l) . 

Given that several wide-field surveys have also made use of 
archival ROSAT data for the serendipitous discovery of c lusters 
(e.g. IJones et alj[l99i iRomer et al-lhoOCt IVikhlinin et allll998l) . 
it is natural to check if any of these overlap with the fields se- 
lected for XDCS. 29 out of the 39 fields w ere used in the 160 
square degree survey of I Vikhlinin et alJ Il998l) . This catalogue has 
the attractive feature that nearly all of the 200+ sources detected 
have been followed up in the optical, many possessing spectro- 
scopic confirmation. Furthermore, their spurious detections are also 
recorded in their paper, so all X-ray detected cluster candidates can 
be examined, and not just the optically confirmed ones. 19 likely 
false detections, arising from concentrations of point sources, were 
recorded, but none of these occurs in the XDCS fields.. Of the re- 
maining 10 fields, 7 were included in the Bright SHARC Survey 
I Rom er et al J 120001) . Both of these used wavelet detection algo- 
rithms in their construction. The SHARC catalogue has had a fairly 
bright ROSAT count-rate limit imposed (corresponding to a flux 
of approximately 10~ 13 erg s" 1 cm" 2 , or about an order of mag- 
nitude brighter than typical XDCS field limits) in order to reduce 



the numbers of clusters found, to make optical follow up achiev- 
able in a reasonable amount of time. Given this limit, 94 sources 
were found in 460 ROSAT fields. It is not too surprising, then, that 
in the seven fields overlapping with XDCS, no so urces are found. 
The SH ARC survey is not considered hereafter. The Vikhlinin et al. 
\ 19981) catalogue (VMF), on the other hand, contains 15 X-ray se- 
lected clusters in XDCS fields. This is the X-ray selected cluster 
survey with which the XDCS optical catalogues will be compared. 

The ROSAT fields observed are given in Table □ The VMF 
clusters in common fields are given in Table|2| 



6 COMPARISON OF OPTICAL AND X-RAY SELECTED 
CLUSTERS 

The VMF X - ray s elected clusters are listed in Table [2] 
Vikhli nin et alJ ^998) used several methods to "confirm" their X- 
ray cluster candidates and it is pertinent to comment on these here. 
Aside from the traditional method of requiring an overdensity of 
galaxies in the optical, they included another possible criterion 
which was that if an elliptical galaxy not included in the NGC cat- 
alogue lay at the peak of the X-ray emission then this should be 
considered confirmation. 

This latter point was designed to include "poor clusters and 
groups which fail to produce a significant excess of galaxies over 
the background". The authors state that it also helps to identify 
"fossil grou ps" in wh ich galaxies have merged to form a single cD 
( Pon manet alll994t Ijones et alj 2000 ) . Such systems appear to be 
as X-ray luminous as other bright groups or poor clusters, but with a 
high percentage of the optical luminosity arising from the dominant 
giant elliptical. The second brightest group member is a factor of 
10 fainter than the brightest (resulting in a gap of 2.5 magnitudes in 
the LF). Recent work estimates that such systems comprise 8—20% 
of all systems of the same X-ray luminosity I Jones et al. 2003). 

Vikhlinin et al. (priv. comm.) obtained optical follow up ei- 
ther from second generation Digitized Sky Survey (DSS-II) plates, 
or R-band (or sometimes B-, V-, or I-) CCD imaging on lm class 
telescopes. Long-slit spectroscopy was also obtained for some can- 
didates, usually for 2-3 galaxies per cluster, and always including 
the brightest galaxy. 

To quantitatively compare the optically selected catalogues 
with the X-ray selected clusters, the following method was used. 
For each of the final MF and CMR catalogues, cross-correlation 
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Table 2. VMF clusters in XDCS fields. Redshifts are given for all but one cluster. The type of redshift measured by VMF is given in the final column (p - 
photometric, s - spectroscopic) 



VMF 


RIXOS 


a(J2000) 


<5(J2000) 


Fx 




z 


Redshift 


ID 


Field 


[hh:mm:ss.s] 


[dd:mm:ss] 


10" 14 erg s — 1 cm -2 


IQ- 14 - ergs" 1 




Type 


j i 


R262 


01 -94-35 1 


-t-04-00-49 


75 


2.2 


0.27 


P 


62 


R221 


08:49:11.1 


+37:31:25 


14.7 


3.0 


0.240 


s 


69 


R248 


09:10:39.7 


+42:48:41 


8.3 


2.0 






73 


R285 


09:43:32.2 


+16:40:02 


23.1 


3.7 


0.256 


s 


74 


R285 


09:43:44.7 


+16:44:20 


21.2 


4.1 


0.180 


s 


84 


R231 


10:10:14.7 


+54:30:18 


21.0 


2.9 


0.045 


s 


86 


R231 


10:11:26.0 


+54:50:08 


20.0 


5.1 


0.294 


s 


94 


R133 


10:56:12.6 


+49:33:11 


12.9 


1.9 


0.199 


s 


97 


R258 


11:17:26.1 


+07:43:35 


6.1 


1.6 


0.40 


p 


99 


R123 


11:19:43.5 


+21:26:44 


5.5 


0.9 


0.11 


p 


131 


R265 


13:09:55.6 


+32:22:31 


9.0 


2.9 


0.290 


s 


132 


R265 


13:11:12.8 


+32:28:58 


46.7 


5.8 


0.245 


s 


150 


R254 


13:43:29.0 


+55:47:17 


17.5 


2.8 


0.11 


p 


181 


R223 


16:33:40.0 


+57:14:37 


3.5 


0.7 


0.239 


s 


194 


R220 


17:29:01.9 


+74:40:46 


17.3 


7.2 


0.28 


p 



Table 3. The nearest optically selected candidates to the VMF clusters. For each X-ray selected cluster, the nearest matching MF and CMR candidates' details 
are given. Candidates in parentheses were not identified in the final catalogues. See text for details. 

* - candidate matched at a separation greater than its estimated radius. 

** - candidate matched within its estimated radius, but greater than 2 arcmins. 

(p) - candidate flagged as exhibiting projection along line-of-sight. 

VMF MF candidate i-VMF Separation zjjf Az CMR candidate offset z-cmr Az 

ID ID (arcmin) ID (arcmin) 



11 


(mf"J012435 


.6+040107) 


62 


mf"J084914 


.5+373123 


69 


mfJ091049 


.4+425002 


73 


mfJ094350 


.5+164034 


74 


(mfJ094344 


.0+164500) 


84 






86 


mf"J101137 


.3+545036 


94 


mfJ105617 


.5+493237 


97 


mfJl 11726 


.2+074316 


99 






131 


mfJ131001 


.9+322110 


132 


(mfJ131113 


.2+322843) 


150 






181 


mfJ163334 


.2+571457 


194 


mfJ172845 


.5+743945 



0.270 


(0.328) 


(0.422) 


0.240 


0.678 


0.276 




(2.246**) 


0.484 


0.256 


(4.421*) 


0.351 


0.180 


(0.691) 


(0.293) 


0.045 






0.294 


1.698 


0.276 


0.199 


0.972 


0.157 


0.400 


0.306 


0.232 


0.110 






0.290 


(1.889*) 


0.437 


0.245 


(0.259) 


(0.422) 


0.110 






0.239 


0.853 


0.395 


0.280 


1.487 


0.484 



0.152 cmJ012437.8+040022 
0.036 cmJ084908.8+373036 
— cmJ091045.1+424955 
0.095 cmJ094329.3+163916 
(0.113) — 

0.018 cmJIOl 134. 1+545014 (p) 

0.042 — 

0.168 cmJl 11726.2+074319 

0.147 cmj 130954.0+322 137 

(0.177) cmJ131111.0+322825(p) 

0.156 (cmJ163337.8+571328) 

0.204 (cmJ172946.3+744238) 



0.807 


0.360 


0.090 


0.939 


0.190 


0.050 


1.596 


0.450 




1.035 


0.190 


0.066 


1.168 


0.330 


0.036 


0.258 


0.370 


0.030 


0.949 


0.270 


0.020 


0.664 


0.210 


0.035 


1.179 


(0.210) 


(0.029) 


3.474 


(0.190) 


(0.090) 



with the IVikhlinin et alJ Jl998^ catalogue was performed, retain- 
ing the nearest match to each X-ray cluster. If the X-ray cluster 
lay within the optical candidate's group radius, it was considered 
matched (the only caveat is that a minimum radius of 1 arcmin and 
a maximum of 2 arcmin was adopted, to ignore excessively large 
or small group radii, and also account for the uncertainty in the X- 
ray position). These matches are tabulated in Table|5| For the X-ray 
clusters with no matches from this process, the full catalogues for 
each algorithm were checked, to see if a lower significance candi- 
date is matched. Such matches are indicated in the table by paren- 
theses. All matches were then inspected visually and special cases 
are commented on. Typical data are shown in Fies l 1 2I & I 1 31 



6.1 Summary of Optical Candidates Associated with X-ray 
Clusters 

In terms of gross numbers with this simplistic matching, the final 
MF catalogue contains counterparts to 7 of the 15 X-ray selected 
clusters, and the final CMR catalogue contains 8 of the 10 in the 
redshift range probed by the algorithm. 

In the MF catalogue: VMF 11 is not matched in the final cat- 
alogue, but is matched in the full catalogue (just below the Cash 
C threshold). VMF69 and 73 were both further than 2 arcmins 
away from the nearest candidates, but lay within the candidates' 
group radii. Thus, these were treated cautiously, but visual inspec- 
tion showed large overdensities extending this far and thus the asso- 
ciation of these objects seems valid. VMF74 and VMF 132 were de- 
tected in the full catalogue, but did not make the higher significance 
cut of the final catalogue. VMF84 was undetected as its redshift is 
too low (0.045). VMF99 and 150 were also undetected. These fields 
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Figure 12. VMF62. Left panel: WFC I-band image with PSPC contours overlaid (images are 5 arcmins on a side and PSPC contours have been smoothed 
to 30 arcsec - the approximate PSF. These contours do not correspond to any particular significance level, and are simply in units of X-ray count rate). Right 
panel: CMD centred on VMF position. Filled circles show galaxies drawn from within 1 arcmin radius of the CMR selected candidate position; open symbols 
show galaxies drawn from an equal area, a further arcmin away. Note: these fixed angular radii are just chosen as a guide, and are not the same as the radii 
chosen by the CMR algorithm (which is colour dependant). Photometric error bars are only shown for filled points for clarity. Overplotted line shows CMR 
corresponding to CMR algorithm's estimated redshift (from M* — 1 to M* + 2). VMF62 is found in both the final MF and CMR catalogues. The associated 
MF and CMR candidates are listed in Tablel3l 




Figure 13. VMF86. Panels as for Fig.[l2] 

VMF86 appears to show a second overdensity even in the DSS image, and possibly a weak second X-ray source, although this was undetected by Vikhlin hi et alj 
1 1998) (A. Vikhlinin, priv. comm.). The second cluster centre is just visible on the extreme right of the I-band image (at Ri[-2.4,0.0] arcmin offset). The CMD 
shows the hint of a second CMR 0.4 magnitudes blueward of the main candidate, mainly indicated by open symbols due to its distance from the primary 
candidate. Thus, this appears to be, in fact, two distinct systems at very different redshifts. Indeed, this system was flagged by the CMR algorithm as being a 
system suffering from projection effects. 
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do not show overdensities of galaxies, and the clusters were possi- 
bly 'confirmed' using the single luminous elliptical criterion. The 
redshifts given are also below the expected detection range (both 
have z=0.11). VMF131 has a match at a distance of 1.9 arcmins: 
this is outside the candidate's estimated radius of 0.5 arcmins, but 
again visual inspection suggests the association is valid. To sum- 
marise, if the three lowest redshift X-ray clusters (VMF84, 99, and 
150) are excluded (z^O.ll), then the strict automated matching 
matches 6 of the 12 candidates. Visual matching suggests the re- 
covery rate in the final MF catalogue should be 91 12. Lowering the 
significance threshold allows the remaining X-ray clusters to all be 
detected (at the expense of a higher spurious rate). 

In the CMR catalogue: considering all except the 5 X-ray clus- 
ters at z^0.2 (for the reasons described above), only two are not 
immediately matched. VMF181 is matched in the full catalogue 
and is of high enough significance (>6cr) to be in the final cata- 
logue, but was 'cleaned' from the catalogue as it had a neighbour 
of higher significance. VMF194 shows a counterpart some distance 
(« 3.5 arcmin) from the X-ray cluster, but the X-ray position ap- 
pears to be at least an arc minute from a significant overdensity 
and CMR (the MF candidate at this position was only matched be- 
cause it has a large associated radius). This cluster has been the 
subject of extensive follow-up work by Vikhlinin and collaborators 
(A. Vikhlinin, priv. comm.) and was difficult to confirm optically. 
It has very extended X-ray emission (~2-3 arcmin) and the galaxy 
overdensity is similarly extended. Spectroscopic follow-up found 4 
out of 5 ellipticals in this field at z=0.213, and so they consider the 
cluster confirmed. Note that this redshift is in much better agree- 
ment with our CMR-estimated redshift than the initial photometric 
estimate of Vikhl inin etal] jl998). 

The CMR technique, furthermore, allows the possibility of 
distinguishing groups projected along the line of sight (the entries 
in Table [3] flagged with a 'p'). VMF86 is identified as two sys- 
tems (as suspected from the data, illustrated in Fig. 1131 : the more 
significant at a redshift of 0.330 and another at 0.230. The quoted 
spectroscopic redshift of VMF is 0.294; this is within Az=0.05 
of the most significant candidate. VMF132 also shows two pos- 
sible further groups, overlapping with the zcmr =0.21 cluster, 
at higher redshift: zca/h=0.37 and 0.65. Thus, to summarise, the 
CMR matches 8 of the 10 z>0.20 X-ray clusters immediately, and 
visual matching allows all 10 clusters to be matched. Another ad- 
vantage of this method is that it is able to disentangle projection 
effects: correctly resolving structure (which is obvious visually in 
the CCD images) in one field, and suggesting higher redshift groups 
in another. 

6.1.1 Comparison of Estimated Redshifts with VMF Redshifts 

The average bias in the redshift estimate, defined as (z spec - 
z p hot)/(l + z spec ), is 0.067 with a standard deviation of 0.066 
for the MF (using 8 spectroscopic redshifts from VMF); the av- 
erage bias for the CMR technique is -0.022 with a standard devia- 
tion of 0.028 (from 6 published VMF spectroscopic redshifts). The 
latter result compares very favourably with photometric redshifts. 
IWittman et alJ 1200 ll) find an average bias of -0.027 with standard 
deviation of 0.059 for photometric redshifts over a similar range 
using four photometric passbands. Gladders & Yee (2000) com- 
mented that redshift determination should include a step to renor- 
malise the stellar population models to the data with redshifts. This 
corrects systematic offsets such as mismatches between the model 
and real filters. Any remaining 'bias' would be due to photometric 
calibration errors, assuming universality of the CMR. 



6.2 Comparison of MF Catalogue with CMR Catalogue 

Now that both optical techniques have been compared with a spec- 
troscopic sample, it can be seen that the estimated redshifts from 
the CMR technique offer greater precision than the MF estimated 
redshifts. Thus, a cross-comparison of the two techniques can be 
made, using the CMR catalogue as a reference. The final MF cata- 
logue was cross-correlated with the final CMR catalogue to deter- 
mine cluster candidates in common. To avoid possible confusion 
from multiple associations of candidates, only MF candidates with 
a single CMR candidate within the former's radius were consid- 
ered. If the candidates' centres were separated by more than two arc 
minutes, they were excluded. Thus, only secure 'clean' matches are 
considered. Of the 185 final MF candidates, 62 show unique CMR 
matches (7 of these are flagged as line of sight group projections). 
A comparison between the estimated redshifts of these techniques 
is shown in Fig. l 151 The average bias and scatter (as defined earlier) 
in this relation are -0.066 and 0.106 respectively, although inspec- 
tion shows that this may equally be due to the MF redshift being 
randomly drawn from values between 0.3 and 0.5. This may be due 
to bias in the candidates selected for this comparison (those with 
'clean' matches between the CMR and MF candidates). A compar- 
ison of the MF estimated redshifts with those of spectroscopically 
determined redshifts (from the X-ray selected clusters in the next 
section) shows that the MF estimated redshift is not as bad as this. 

A further comparison is to consider the MF candidates not 
matched with CMR candidates. This was done by comparing the 
full CMR catalogue with the final MF catalogue and searching for 
MF candidates with no CMR matches within their radii (or 2 ar- 
cmins). 41 of the final MF candidates show no CMR counterparts 
at any significance level. Under the assumption that all genuine 
clusters possess a CMR and that this technique will find them, this 
can be used as an approximation to the number of spurious MF 
detections. This gives a false detection rate of around 22%. This 
is in general agree ment with est imates for other MF techniques 
of ar ound 30% (e.g. lHolden et alJI 19991) . 22% is a lower limit, as 
some of the CMR matches are of low significance. Assuming a 
spurious CMR r ate of around 5—10% (as seems more likely, e.g. 
Gladd ers & Yeell2000] : lGilbanldl200ll) would bring the false posi- 
tive rate of the MF into closer agreement with the 30% value. Us- 
ing the fraction of matched candidates flagged as projections (7 out 
of 62, above) comp ares well with the (spectroscopic) findings of 
iKatgert et alJ 1 19961) that around 10% of Abell clusters comprise 
two or more significant clusters, projected along the line of sight. 



6.3 Comparison of Optical Richness Measures with Lx 

Henceforth we consider only the catalogue generated by the CMR 
technique. We have just shown through comparison with the spec- 
troscopically confirmed sample of Vik hlinin et alJ 1 19981) and from 
internal comparisons within our data the superiority in terms of ac- 
curacy and reliability of the CMR algorithm over the MF approach. 

The following procedure was used to measure the X-ray flux 
for each optical cluster candidate. 

The relationship between each of the richness measures and 
X-ray luminosity for cluster candidates in the optically selected cat- 
alogues will now be presented. Since the vast majority of the opti- 
cally selected clusters have no X-ray selected counterparts, fluxes/ 
flux limit were measured at the positions of the optical candidates. 

Aperture fluxes were measured from the X-ray images. The 
peak X-ray flux within a 1.5 arcmin search radius was located, to 
provide the centre of the measurement. A 1.5 arcmin radius aper- 
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Figure 14. Comparison of the optical cluster finders' redshift estimates with Vikhlinin et al.'s redshifts. Open symbols are for VMF photometric redshifts 
with eiTor bars showing their estimated range; filled points are for spectroscopic redshifts. Dashed line shows the one to one relation, and crosses on this line 
indicate undetected clusters. These numbers are tabulated in Fig. [3] The CMR redshift appears to systematically underestimate the VMF spectroscopic redshift 
by fsO.03, in all but one case. This is for VMF86 which is in fact two systems, as indicated earlier. The more significant candidate happens to be the higher 
redshift one, but if VMF measured a redshift for the lower redshift system, then this too would be underestimated by a similar amount. 




Figure 15. Comparison of MF and CMR estimated redshifts. Filled points 
are unique matches; open points are flagged as line of sight projections in 
the CMR catalogue. Dashed line is the one to one relation. 



Figure 16. Comparison of our aperture X-ray fluxes with the wavelet fluxes 
of VMF98 for clusters in common with their sample. VMF IDs are indi- 
cated. 



ture was used. This aperture flux was then corrected to a total flux 
computing the ratio of total flux to flux within the 1.5 arcmin ra- 
dius aperture for a cluster beta model of typical parameters. This 
method is a compromise between choosing too large an aperture 
and increasing the chance of background contamination from point 
sources, and choosing too small an aperture and missing cluster 
flux. This aperture was found to give the best ag r eeme nt with the 
wavelet X-ray flux measured by IVikhlinin et alj ll998l) fFig 1161 . 
Also tried was the use of a standard maximum likelihood tech- 
nique to fit a cluster beta model to the X-ray surface brightness 
profile, but this gave largely poor fits, as the systems we detect are 
not well-matched to the standard model assumptions used (such as 
spherical symmetry). 

The ROSAT count rate in the 0.5—2.0 keV band was converted 
to a bolometric luminosity correcting for the observed Galac- 
tic hydrogen column density, assuming the CMR estimated red- 
shift and assuming a cluster temperature of 5 keV. The luminos- 
ity was then iterated once by assuming a fit to the X-ray lumi- 
nosity temperature re lation given by T x = 2.75(Lj, j x Msn) ' 357 
(a compendium from lReichart et alll999tlArnaud & Evrarcjl999t 



iMarkevitchll 19981 lAllen & Fabiar] |l998t) . This has little effect on 
the resulting luminosity. The background flux rate was determined 
for each object by the maximum likelihood method in which it 
was a free fitting parameter. The background flux rate was also 
measured for each field by placing 100 apertures randomly around 
each image and measuring fluxes in the same way. 3a outliers were 
rejected from these estimates to obtain the median and variance. 
The background values obtained by both methods agreed well with 
each other. The significance of each optical candidate X-ray flux 
measurement was determined relative to the variance of the back- 
ground flux measurements in each field. If the measurement was 
a greater than 3a event, this was classed as a detection. For other 
measurements, a 3a upper limit was found from the limiting flux 
for the field. The typical 3a limiting flux is around 6xl0~ 14 erg 
s _1 cm~ 2 . X-ray detections at ~^3a were visually inspected, and 
those showing contamination from an obvious bright point source 
were rejected from the analysis. 40 of the 290 apertures resulted in 
>3<t X-ray detections. 

A plot of each of the richness measures described in inversus 
the X-ray luminosity is given in Figs.[Hland[T9l 
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Firstly we consider the relation between the total early type 
galaxy luminosity, L_b and X-ray luminosity, Lx. The two quan- 
tities appear to be correlated, but with large intrinsic scatter. One 
concern about plotting two measures of luminosity against each 
other is the fact that the two are correlated through the distance 
to each object. In order to show that this is not producing the ob- 
served correlation, Fig. 1201 shows the ratio of the two luminosi- 
ties as a function of redshift. This distance-independent measure- 
ment shows that there is still an intrinsic scatter in the luminosities 
at each redshift interval. We use a Bayesian maximum-likelihood 
technique to fit the relations shown in Fig.s l 1 8lfT9l We assume that 
there is a power-law correlation between L e and Lx with a large 
intrinsic scatter (a), which we assume to be Gaussian in logLx, 
and is independent of L e . There are three model parameters that 
must be determined: the normalisation and exponent of the mean 
relation, and the scatter about the relation. The statistical problem 
is an unusual one because the upper limits far outnumber the de- 
tections. Furthermore, the scatter in the relation is much larger than 
the errors in the measurements. To simplify the fitting procedure, 
we therefore treat the measurement errors as a component of the 
scatter in the model relation. 

We tried three methods of fitting the data. Firstly, we can use 
only the X-ray detections. This gives Lx = lO" 0338 L E °' 94 , but 
this model relation fails to take any account of the large number of 
upper limits. Secondly, we can incorporate the upper limits by treat- 
ing the limit as a data point. We eliminate the data points flagged 
as contaminated by point sources from this fit. This gives a rela- 
tion that has lower normalisation and larger scatter. Our ML esti- 
mator gives similar results to conventional regression schemes in 
both these cases. However, while the second approach takes into 
account the upper limits, it does not allow for the possibility that 
the actual L x might lie significantly below the upper limit. Finally, 
we apply our maximum-likelihood estimator, treating the upper 
limits correctly. We must make two Bayesian assumptions to ap- 
ply the method: (1) we assume that the probability distribution for 
the value underlying an upper limit is uniform in L x (rather than 
log L x ); (2) we allocate greater prior weight to models with small 
scatter by introducing a prior weighting P P rior(a) oc K This is ap- 
propriate to values bounded between and 00 and gives equal prob- 
ability per decade in a. Without this assumption (and since upper 
limits greatly outnumber detections), the likelihood estimator gives 
too much weight to models with very high scatter and very low nor- 
malisation. This results in the relation Lx = 10~°' 740 Lb °' 84 , with 
a scatter of 10°' 23 . The effects of the different fitting techniques 
are illustrated in Fig. 1171 A similar method for the B gc parameter 
results in the relation Lx~ W~ 5 ' 08 B gc 159 , again with a scat- 
ter of 10°' 23 , fitting both detections and limits with this Bayesian 
approach. 

1 11 1 

In addition, the richness measure No. 5 (Bahcall 1981) simi- 
lar to the Abell Richne ss Class, was st udied and found to give an 
unusably large scatter. lYee & L6pez-Cru3 il99St) also found this 
measure gave unacceptable scatter. 

Examples of outliers in these relations are now considered, 
to assess if clusters with similar richnesses do indeed exhibit very 
different X-ray luminosities. The candidates listed in Table [4] are 
chosen as obvious outliers in the Lx- Lb plot. For brevity, the 
candidates are referred to as A, B, C, D. For each of these, an 
I-band image with X-ray contours overlaid and a CMD with the 
fitted CMR overlaid is shown (Figs 121^241 . to verify the proper- 
ties which locate them in the Lx- Lb plot. Candidate A is one of 
the least X-ray luminous candidates, but is also not very optically 
rich. Candidate B is highly X-ray luminous and of moderately high 
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Figure 20. X-ray luminosity / Lb as a function of CMR-estimated redshift. 
Symbols as for Fig. 1181 



Table 4. Table of properties for interesting outliers from Lx- rela- 
tion. Letter preceding candidate ID is for brevity when discussing these 
case studies in the text. 



Candidate ID 


Lx 44 


[12 
L B 


Bg C 


Zest 


A 


-cmI072345.2+712742 


<0.06 


0.7 


500 


0.16 


B 


-cmJ162617.5+781706 


6.5 


3.9 


720 


0.55 


C 


-cmJ131148.5+322803 


25.3 


18.6 




0.70 


D 


-cmJ032903. 1+025640 


^0.4 


4.0 


1100 


0.37 



optical luminosity, laying just above the 3 a upper bound of the 
relation. Candidate C is the most X-ray luminous and optically lu- 
minous candidate; and candidate D has a comparable richness to 
candidate B, but is an order of magnitude less X-ray luminous. 

The two high Lx systems (B and C) indicated have very dif- 
ferent optical richnesses. Due to the volume probed and the rarity of 
such luminous clusters, these systems are expected to be at the high 
redshift end of the survey (the volume between 0.2<z<0.5 is simi- 
lar to that between 0.5<z<0.7) and, indeed, they are both found to 
lie in the range 0.5<z<0.7. Conversely, in order to be detected, the 
faintest systems must lie at low redshift (A). The fairly optically 
rich system with a low X-ray upper limit (D) lies intermediate in 
redshift to these extremes. 

From these figures <21I - I24> . the example cluster candidates 
appear to have the properties measured in the catalogues and shown 
in Table |4| For example, the X-ray detections do not appear con- 
taminated by point sources, and the redshift estimates seem entirely 
consistent with the predicted CMRs. 

Cases B and D have similar L b values, but their X-ray lumi- 
nosities vary by over an order of magnitude. Examining the relation 
using B gc instead of Lb shows that both these systems occupy 
similar regions of this plot. 

The scatter in the relation could be attributable to a number 
of factors. The physical processes involved in the determination of 
Lx have been discussed in but will be reiterated here, along 
with a discussion of Le- The X-ray luminosity is dependent on 
the temperature and density of the gas. These in turn depend on 
the dynamical state of the cluster (which determines the depth of 
the gravitational potential, and the densities that can be reached by 
the gas). The presence of a cooling flow increases the luminosity 
by increasing the gas density. Lx can also be increased through 
unresolved point source contamination. 
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Figure 17. Results of different fits to the Lj- Le relation. The data are shown in the upper left panel with detections as open circles and upper limits as 
points, for clarity. Overplotted lines show the different fits in the same line style as the contour plots. Contours show confidence limits for the fitted scatter 
and intercept (at Lg =10 12 Lq) - upper right panel: fitting only the detections; lower left panel: fit to upper limits and detections, using Bayesian approach 
described in text; lower right panel: all points fitted assuming upper limits are detections (this is essentially the same as assuming detections would lie just 
below the limits). Confidence levels run from 1-cr to 1-a in 1-<t intervals. 



The optical properties of the cluster candidates obviously de- 
pend on the properties of the member galaxies. Since these systems 
were selected on the presence of a CMR, a population of galaxies 
which formed their stars at high (z>2) redshift, and terminated star 
formation shortly after, is required. 

The interplay between the intracluster medium and the cluster 
galaxies is likely to be important and not straightforward to model. 
Several workers (e.g. IPonman et alll9 99: Bowe r~ all200lh have 
recently investigated such interplay using numerical simulations. 
They propose energy injection at early times from supernovae in 
cluster galaxies as a method to 'pre-heat' the ICM and produce ob- 
served relations such as the Lx- Tx relation. It is then quite likely 
that the scatter comes from variation in the X-ray luminosity rather 
than the mass to galaxy luminosity conversion. 



7 SPECTROSCOPIC OBSERVATIONS OF X-RAY DARK 
CLUSTER CANDIDATES 

In order to confirm the reality of candidate cluster systems from 
the XDCS, follow-up multi-object spectroscopy was undertaken 
with the MOSCA 3 instrument on the Calar Alto 3.5m telescope. 



MOSCA is a focal reducing spetrograph, installed at the Richey- 
Chretien Focus of the 3.5m telescope on Calar Alto. The reduction 
ratio of the optical system is 3.7, i.e. the effective focal ratio is f/2.7. 
This gives an image scale of 3 pixels per arcsec and a total FOV of 
11x11 arcmin. A thinned CCD with 2048x4096 15 micron pixels is 
used as the detector. The med-green grism was selected. This gives 
a wavelength coverage of 4300 - 8200A, with a central wavelength 
of 5500A and a dispersion of around 2.5A/ pixel and resolution of 
around 10A FWHM. This allows distinctive spectral features to be 
seen over a wide range of redshifts from z<0. 1 to z>0.6. 

Cluster candidat es with no ex tended X-ray counterpart, de- 
tected in the Vikhlin in et alJ ll998t) catalogue fields were chosen. 
The only other criterion applied was that the RA range available 
for the observing run meant that the candidates had to come from 
the subset of XDCS originally observed during the June 1998 run. 
This subsample comprises 16 fields, or approximately 4.5 square 
degrees. These targets are listed in Table|5| For each one, astrome- 
try was performed using the STARLINK program ASTROM to con- 
vert pixel coordinates into sky coordinates as measured from the 
APM catalogue 4 , to an rms accuracy of of ~0.3 arc seconds. Multi- 
object slit masks were constructed using a constant slit width of 1.5 
arcsec and a slit length of at least 10 arcsec. In order to be included 



3 |http : / /www. mpia-hd.mpg.de /MOSCA/ index .html| 



4 |http : / /www . ast . cam, ac . uk/~mike/apmcat/| 
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Figure 18. X-ray luminosity vs the Le richness measure. Filled points are >3<r X-ray detections; downward arrows are 3<r upper limits. Solid line is the best 
fit relation of the detections and limits using the Bayesian technique described in the text. Inset shows specific systems which are further discussed in text. 



Table 5. MOSCA mask centres. Note: the IDs are just numbered subfields 
of the RIXOS fields, and should not be confused with the similar nomen- 
clature used for X-ray candidates by the RIXOS collaboration. 



Candidate 


a (J2000) 


<5 (J2000) 


R110.1 


14 28 22.0 


+33 07 13 


R220.2 


17 23 37.9 


+74 43 17 


R236.1 


17 02 58.9 


+51 53 52 


R294.1 


23 19 54.5 


+12 32 27 



as potential slit candidates, galaxies had to be brighter than Ic=20 
(often a few galaxies fainter than this were allocated slits to fill the 
masks). 

7.1 Spectroscopic Observations and Data Reduction 

The spectra were secured over six nights of observations in July 
2000 using MOSCA on the Calar Alto 3.5m. A log of the observa- 
tions is presented in TableQ 

Before each night of observation, a series of bias frames (typ- 
ically five) was taken. For the purpose of wavelength calibration 
(WLC) two different comparison arcs were observed. This was 
done to ensure an adequate number of emission lines over the full- 
wavelength range covered by the MOSCA med-green grism. A 15s 



Table 7. Log of observations of cluster candidates from Calar Alto. 



Night 


Field 


Mask 


Exposure time (s) 


27 - 28/07/00 


R110.1 


1 


3 X 1800 


28 - 29/07/00 


R236.1 


1 


3 X 1800 


29 - 30/07/00 


R220.2 


2 


2 X 1800 


29 - 30/07/00 


R236.1 


2 


3 X 1800 


30 - 31/07/00 


R294.1 


1 


3 X 1800 


31/07 - 01/08/00 


R294.1 


2 


3 X 1800 



exposure of the HgAr/Ne arc was taken. This was augmented by a 
120s exposure of the Ar lamp using the 472/78 filter. The long ex- 
posure was used to make weak emission lines clearly visible, and 
the (BV-band) filter was used to suppress lines at the red end of 
the spectrum, which would otherwise become saturated. A combi- 
nation of the HgAr/Ne spectrum and lOOx the Ar spectrum was 
found to provide a good reference spectrum for WLC (hereafter, 
WLC refers to this composite arc spectrum image). Such calibra- 
tion frames were taken before twilight, at the start of each run. Flat- 
field frames were taken using MOSCA's internal tungsten lamp. 
For the science observations, three exposures were made of each 
mask, each of 30 minutes duration. After each series of science 
frames, whilst the telescope was still pointing at the object, another 
HgAr/Ne frame was taken (although it was not needed, see later). 
This was done in case flexure in the instrument due to the tele- 
scope's different pointing position affected the arcs taken earlier in 
the evening (while the telescope was parked, and therefore point- 
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Figure 19. X-ray luminosity vs the B gc richness measure. Filled points are >3<r X-ray detections; downward arrows are 3cr upper limits. Solid line is the 
best fit relation of the detections and limits using the Bayesian technique described in the text. Inset shows specific systems which are further discussed in text 
(note: a B gc value could not be computed for case study C). 
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Figure 21. Plots as for Fig. [HI for case study A (candidate cmJ072345. 2+712742). Solid line in CMD indicates model CMR for estimated redshift. 
Z CMR =016. This system is X-ray underluminous, and of reasonably low optical luminosity. 
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Figure 22. Plots as for Fig.lHlfor case study B (candidate cmJ162617. 5+781706). zqmr =0.55. This is a fairly optically luminous system with a very high 
X-ray luminosity for its richness. 
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Figure 23. Plots as for Fig.lHlfor case study C (candidate cmJ162617. 5+78 1706). zqmr =0.70. This is a high redshift, optically rich candidate with very 
high X-ray luminosity. 



ing at zenith). The Ar arc was not repeated during the night as its 
longer exposure time added an unacceptable overhead. 



apall. Wavelength calibration was carried out using the compos- 
ite arcs. No flux calibration was performed. 



7.2 Data Reduction 

Data reduction was carried out using standard I RAF routines. Mas- 
ter biases were created for each night by combining several bias 
images. Full 2D bias removal was necessary as the bias frames dis- 
played banded structure. Flatfielding was attempted but found to 
offer no improvement in the identification of spectral features and 
so was omitted. The three science exposures of each image were av- 
eraged to reject cosmic rays. The spectra were then extracted using 



7.3 Spectroscopic Redshift Determination 

The Fourier cross-correlation technique of iTonrv & Davisl 1 19791) 
was applied to the wavelength-calibrated spectra. This technique 
continuum-subtracts and Fourier transforms the galaxy spectrum 
and a reference template, applies high- and low-pass filtering, and 
looks for peaks in the cross-correlation function of the two. 

The template used was a de-redshifted E/S0 (as used by the 
CNOC collaboration, courtesy of E. Ellingson), hence no emission 
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Figure 24. Plots as for Fig. ll2l for case study D (candidate cmJ032903. 1+025640). tqmr =0.37. This is a candidate of similar optical richness to case B (at 
slightly lower redshift; 0.37 vs 0.55) but with an order of magnitude lower X-ray luminosity. 



Table 6. XDCS CMR candidates in MOSCA Fields. Dist indicates distance from field centre, t - (p) in estimated redshift indicates that candidate is flagged 
as a line-of-sight projection by the method described in 34. II X-ray fluxes and luminosities (using estimated redshifts) are 3<r upper limits for all candidates, 
none of which is detected in X-rays. 



Field 


XDCS ID 


Dist (arcmin) 


Zest * 


Fx 












(lO-^ergs- 1 era" 2 ) 


(lO^ergs- 1 ) 


R1101 


cmJ142812.0+330736 


2.1 


0.160 


^3.0 


^0.02 


R2202 


cmJ172333.0+744410 


1.0 


0.210 


^4.5 


^0.06 


R2361 


cmJ170244.2+5 15539 


2.9 


0.310 (p) 


<4.6 


^0.32 


R2361 


cmJ170232.6+514922 


6.1 


0.300 


<5.1 


^0.29 


R2361 


cmJ170258.9+514921 


4.5 


0.470 


<C3.4 


^0.90 


R2941 


cmJ231951.2+123208 


0.9 


0.370 (p) 


<2.0 


<0.08 



lines were used in the initial redshift determination (emission line 
objects are considered below). Each mask was run through f xcor, 
and the redshift from the highest cross-correlation peak logged. 
Each spectrum was then de-redshifted using the f xcor redshift. 
The de-redshifted spectrum then had the position of prominent ab- 
sorption lines (Ca H+K, G-band, H-beta and Mgb) and emission 
lines Oil and H/3 overplotted. The spectrum was then visually in- 
spected and a quality flag assigned to it, either: 2 - the redshift is 
confident; 1 - the redshift is less certain but looks compatible with 
the positions of the lines; or - no redshift is possible (usually due 
to too low S/N). 

Some spectra were also flagged for re-processing through 
f xcor, if the redshift was clearly wrong, and sufficient signal was 
present to get a better redshift estimate. The main reason for an 
incorrect redshift was the presence of large residuals from the sub- 
traction of bright night sky lines. Note that in the Fourier cross- 
correlation, the direction (i.e. absorption or emission) of lines is 
not taken into account; therefore, night sky residuals which approx- 
imate the positions of absorption features in the template can be 
confused. 

Absorption lines in the galaxy spectrum were logged, and if a 
possible emission line was present, the 2D spectrum was inspected 



to check if the emission was a genuine galaxy feature, or a residual 
sky line. 

For several spectra, for which the redshifts were readily appar- 
ent, and very strong emission was seen, the E/S0 template gave a 
poor redshift estimate. In this case an Sab/Scd template was substi- 
tuted and was found to give a much better fit. Emission line objects 
are clearly noted in the tables of results, below. 

Furthermore, once groupings in redshift space had been lo- 
cated ( £17.41 . all the spectra which failed to yield a redshift were 
re-examined to see if they were compatible with the redshift of any 
groupings. This yielded one extra redshift which had been missed 
previously. 

All redshifts were confirmed by visual inspection, by over- 
plotting the spectral features shown in Fig.|25|on the de-redshifted 
spectra. In order to be considered a confident redshift, two or more 
spectral lines had to be clearly visible and other features had to 
have some good reason for not being seen (e.g. strong sky residuals 
concealing a feature which should have been present). 
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Table 9. Galaxy Groupings in Redshift Space 



Field 


Galaxy 


Centroid" 




N T ot c 


median 


Az d 


N e ,. 

Aosp 




grouping 


a (J2000) 


5 (J2000) 






z 




R110.1 


la 


14 28 27.9 


33 05 24 


3 


8(13) 


0.196 


<0.001 


3 


R220.2 


2a 


17 23 25.1 


74 43 45 


7 


14(21) 


0.260 


0.003 


5 


R236.1 


3a 


17 02 59.2 


51 53 25 


6(7) 


24(30) 


0.297(0.297) 


0.007(0.007) 


3(4) 


R236.1 


3b 


17 02 52.4 


51 54 00 


11 


24(30) 


0.347 


0.004 


11 


R294.1 


4a 


23 20 00.2 


12 32 06 


3 


17(25) 


0.268 


0.001 


1 


R294.1 


4b 


23 19 48.0 


12 32 58 


3 


17(25) 


0.325 


0.003 


3 


R294.1 


4c 


23 19 55.6 


12 32 20 


5 


17(25) 


0.454 


0.010 


4 



a Centroid of members of grouping, using class 2 redshifts. 

b Number of galaxies in grouping - class 2 spectra (class 1 & 2 spectra). 

c Total number of class 2 (class 1 & 2) spectra in field. 

d Maximum redshift separation between a galaxy in the grouping and the median redshift of 
the grouping. 

e The number of absorption line only (i.e. no emission) galaxies in the grouping. 

Note: galaxies must be within 1500 kms in the rest-frame, at the median redshift, to be 
considered members of the grouping (see text). 



Table 8. Summary of Spectral Quality. 2 - secure redshift; 1 - less confident; 
- rejected. 



Field 


Number of Spectra 
Class Class 1 Class 2 


R110.1 


2 


5 


8 


R220.2 


14 


7 


14 


R236.1 


8 


6 


24 


R294.1 


7 


8 


17 


K H G-band 


H/S 


Mgb 






Figure 25. Representative de-redshifted spectra. From top to bottom: spec- 
tram with confident redshift; spectrum with confident redshift and emission 
lines; spectrum with less confident redshift. 



7.4 Significance of Clustering in Redshift Space 

Groupings in redshift space were extracted by searching for 3 or 
more secure redshifts separ ated by 1500 kms~ or less. This is 
the same method adopted by Holden et al. 1 1999) and corresponds 
to 3 x the typical cluster velocity dispersions they measured. It 
should be noted that a larger value was also tried, but 3<r clipping 
(described later) removed any extra galaxies added. The results of 
groupings found by this technique are illustrated in Fig. 1271 and 
analysed below. 

iRamella et ail <200Cl) and iHolden et al] \ 19994) used similar 
techniques to a ssess t he significance of clustering in redshift space. 
IRamella et alj l2000l) 's method is followed here. The selection 
function was calculated as follows. Fig. 1261 shows the number of 
galaxies for which redshift measurements were possible, as a func- 
tion of magnitude. Henceforth, only secure redshifts will be con- 
sidered. Note that all cluster members in the sample are class 2 (i.e. 
secure) redshifts, except one which is class 1. Of the 121 spectro- 
scopic targets, 61 resulted in secure redshifts, and a further 26 with 
less secure measurements. Two of these objects were stars. The 
majority of galaxies which fail to yield a redshift are fainter than 
I c = 20.0. 



As with the 2D data in £|3 . 1 .41 it is important to model the 
clustering of field galaxies when constructing mock galaxy dis- 
tributi ons. By u sing the Canada France Redshift Survey (CFRS, 
iLillv et alll995t) to construct a sim ulated field redshift distribution 
(as was done bv lHolden et alll999l) . and bootstrap resampling sets 
of galaxies, an estimate can be made of the fraction of spurious 
clustering detected in redshift space. Sets of 15 galaxies were ex- 
tracted - the mean number per field (2 masks) for which confident 
redshifts were secured. 10000 galaxy sets were generated, applying 
bootstrap resampling, and the fraction of sets containing a group- 
ing of more than 4 galaxies within 1500 kms -1 of their median 
redshift found. This occurs by chance ~6% of the time. For field 
Rl 10_1, only 3 galaxies were found within this velocity difference, 
but fewer than average redshifts (8) were obtained (due to only 
one, rather than two, masks being used). The velocity difference 
between these 3 is less than 1000 km s" 1 . This also occurs about 
6% of the time, and is therefore approximately as significant. If the 
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Figure 26. The I-band magnitude distribution of galaxies for which spec- 
troscopy was attempted. The empty histogram shows galaxies for which no 
redshift was determined; the black histogram shows galaxies with confi- 
dent redshifts; and the grey histogram shows galaxies with less confident 
redshifts. 




Figure 27. Large scale redshift distributions for cluster candidates: Rl 10_1 
(upper left), R220.2 (upper right), R236.1 (lower left), R294.1 (lower 
right). Line thickness indicates confidence in the redshifts. Bold lines are 
confident redshifts. The bin size is 0.004, which corresponds to a rest frame 
velocity of 1200 to 800 kms - 1 at the left and right sides of the plot, respec- 
tively. 

velocity difference is reduced to 1000 km s , the likelihood of 
finding 4 or more galaxies this close together in an observation of 
a 15 galaxy set is only 2%. These numbers are used as a guideline 
to the significance o f grou pings in redshift space. 

Ramella et al. 1 2000 ) take this technique further by trying to 
reproduce more accurately the magnitude selection function. To do 
this they take the histogram of magnitudes for which spectra were 
obtained and divide this by the total number of galaxies in the same 
area in the same magnitude bin (i.e. the histogram shown in Fig. 1261 
is divided by the field galaxy number counts - Fig. [2]- the result is 
shown in Fig. l28> , 

If the luminosity function is universal and the local normali- 
sation is the same everywhere (i.e. no clustering), then the redshift 
distribution is given by: 



.m, -, -, -, ^ 

17 18 19 20 21 

Figure 28. The magnitude selection function, s(m), for our spectroscopic 
sample. See text for details. 



N ^ = ^~chT I * f[L(m,z)]s{m)dm (10) 

jRamella et alfeOOOh where dV is the volume element, m,\,rai are 
the magnitude limits and / is the Schechter LF given in £13.1.21 
Applying the selectio n function, s(m) give s the redshift distribu- 
tion shown in Fig. 1291 Ikamella et alJ ^200(3) note that using such a 
distribution gives a false impression of the significance of group- 
ings, as the clustering in redshift space must be accounted for. In 
order to do this, the CFRS is again bootstrap resamp led, but this 
time u sing the magnitude selection function s(m). In Ramella et al. 
(2000)'s method, they compare the N(z) distribution of their data 
with that of the CFRS using their selection function, and state that 
a Kolmogorov-Smirnov test shows the two datasets have similar 
distributions. Using the XDCS spectroscopic sample, however, a 
KS test shows the bootstrapped CFRS N(z) and the N(z) in Fig.l29l 
are significantly different at the >90% level. It appears that this is 
because a larger fracti on of the redshifts in the XDCS sample are 
cluster members. The Ramella et al. 1 2000) sample targeted higher 
redshift candidates, and so recovered a lower fraction of cluster 
members than the data presented here (most of the XDCS cluster 
candid ates lie in the range 0.2<z<0.4). Therefore Ramella et alJ 
(2000)'s data follow the CFRS N(z) as most of their data are field 
galaxies (so it is correct when they state that magnitude selection is 
the main process leading to the inclusion of galaxies in the sample). 
In the XDCS sample, however, significant grouping s of 6 or more 
redshi fts (more than the individual groupings in the Ramella et alJ 
(2000) data) are present, and thus the total (cluster + field) sample 
is not represented by the field survey of the CFRS. This difference 
in N(z)'s provides reassurance that significant clusters have been 
found. 

Using Gaussian s(m) leads to different probabilities of 
false detections. For n req =4, P(false)=0. 19; and for n re q=5, 
P(false)=0.03. This illustrates that magnitude selection has a big 
effect on the significance assigned. 

To summarise these tests: the CFRS has been used to simu- 
late the redshift distribution of field galaxies. Two different mag- 
nitude selection functions have been used to sample this survey. 
Bootstrap resampling of the data is used to calculate the probabil- 
ity, P(false), of incorrectly identifying a grouping of n rcq galax- 
ies in redshift space - the galaxies being selected in the same way 
as for the MOSCA targets. For the simplest selection function (a 
step function in magnitude, selecting galaxies brighter than mu m ), 
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Figure 29. The redshift distribution function for our spectroscopic sample. 
The dashed line shows the N(z) described by Equation llOl 

P(false)~0.06 for n req =4. For the best-fit Gaussian magnitude se- 
lection function, P(false)~0. 19 for n req —4, and P(false)~0.02 for 

Tlreq — 5. 

These are the most recent techniques used in the literature for 
estimating the significance of redshift groupings found from opti- 
cal cluster surveys. However, there are several problems with them. 
Firstly, using the CFRS to model the field does not take any account 
of the presence of groups within it. Thus, for finding the lowest ve- 
locity dispersion systems, which will be numerous in any wide-field 
survey (and therefore the CFRS), these methods underestimate the 
significance of systems found. Secondly, not all the available in- 
formation is used. The most basic spectral properties (i.e. whether 
or not the cluster members possess emission lines) and the colours 
can be used to infer the types of galaxies in the sample. Absorp- 
tion line systems with red colours at the given redshift are highly 
indicative that a galaxy is of early-type. Since these systems domi- 
nate the cores of known clusters, but are much less common in the 
field, their presence increases the likelihood of a cluster. This tech- 
nique, however, would be biased against systems not containing 
early-type galaxies. 

It should be noted that using a mean N(z) from many fields 
would tend to overestimate the significance of clustering found in 
any one pencil beam survey, especially if galaxies lie in sheets 
along the line of sight. 

By taking the field gala xy luminosity function from the 
CNOC2 survey <Lin etalll 999). the expected number of early-type 
galaxies in a given volume can be calculated. iLin et ail ^999)'s 
parameters for the Rc-band luminosity function in the qo=0.1 cos- 
mology are taken. This is a Schechter LF with two additional pa- 
rameters to model the evolution in luminosity and densitv. lLin et alJ 
1 1999) state that to convert their LFs to another band, a good ap- 
proximation is to just apply an appropriate offset in M* based on 
the mean rest-frame colour for that galaxy type. Thus, to calcu- 
late the LF at z=0.3 fo r early-type galaxies, a res t-frame colour 
of Rc-Ic=0.71 is used l Kodam a & Arimotdl 19971) to correct M* 
to the Ic-band, along with a correction for the different value of 
h (+5\ogh). The value of <£>* is taken from the R c value, only 
correcting for the h difference, and applying the evolutionary pa- 
rameters P and Q (Table 2. llin et alll999l) to correct to a redshift 
of 0.3. The difference in $* over the range of interest for the ex- 
treme XDCS spectroscopic candidates (i.e. 0.2 < z < 0.5) is less 
than a factor of 2, and so, for simplicity, a fiducial redshift of 0.3 is 



used here. This results in a space density of early-type galaxies in 
the field at redshift 0.3 of 0.04 Mpc" 3 . 

The number of galaxies in each redshift grouping showing 
only absorption features is given in Table |5| Since these systems 
do not show emission lines and have colours consistent with early- 
type galaxies at the cluster redshift (Table|9j, these are taken to be 
early-type galaxies. A generous estimate of the volume from which 
each of these redshift groupings is drawn is to take an angular size 
of 5 arcmins (approximately the maximum separation on the sky 
between galaxies in the same redshift grouping) at a redshift 0.3, 
and to assume the volume is a sphere of this radius (this is about the 
same size as given by a line-of-sight velocity difference of a 1000 
km s _1 , again a generous value for these groupings). This translates 
into a volume of ~ 8 h _1 Mpc 3 . Since the space density of early- 
type galaxies in the field at this redshift is of 0.04 h~ 1 Mpc 3 , the ex- 
pected number in such a volume is 0.32. Assuming the field can be 
modelled by a Poisson distribution with this expectation value, the 
likelihood of finding 1 1 early-type galaxies (the maximum found - 
candidate 3b) is only ~lxl0~ 8 . The likelihood of finding 1 (the 
minimum - candidate 4a) is 25%; and the likelihood of finding 3 
(the minimum of all the remaining candidates) is 0.3%. 

This argument is an over-simplification as firstly it assumes 
the field can be modelled as a Poisson distribution, which is not 
strictly correct because of clustering, but not a bad approximation 
(it just raises the expectation value slightly). Secondly, the candi- 
dates were selected to be overdense in galaxies of the same colour. 
This means the fields selected were not typical regions of space. 
However, selecting galaxies on the CMR was not guaranteed to se- 
lect early-type galaxies at the same redshift, but this is the result 
which was found from the spectroscopy. Therefore, the method is 
still valid. Thus, by this simple argument, it seems reasonable to 
assume that all the groups containing at least 3 early-type galaxies 
are significant. This method therefore rejects grouping 4a. In sum- 
mary, all the redshift space groupings are found to be significant 
by this technique, except candidate 4a which contains 3 galaxies, 
but only 1 of which is early-type (i.e. red and emission-free). This 
candidate is found to be significant by the previous two methods 
which do not take colour/ type into account. Clusters of this nature 
(i.e. not dominated by early-type galaxies) have not been observed 
before, so this system must be treated cautiously. 

7.5 Comparison of Significant Redshift Groupings with 
Cluster Candidates 

Now that the groupings in redshift space have been identified and 
their significances assessed, the final step is to compare these with 
the candidates detected with the cluster-finding algorithm. Firstly, 
a simple comparison will be made by just finding the nearest candi- 
date in the catalogue (Table|6j with the centroids of the groupings 
found with MOSCA (Table|9j. These are tabulated below for the 
CMR algorithm (TablefTol. 

For the CMR-finder, the offset between measured and esti- 
mated redshift for the two isolated groups (Rl 10_1 and R220_2) is 
Az^0.05 for both. For the multiple systems, although only one es- 
timated redshift is given in the table for each field (for the most sig- 
nificant candidate), these candidates were flagged as line-of-sight- 
projections in Table |6| The estimated redshift of the most signif- 
icant CMR candidate is always intermediate between the spectro- 
scopic redshifts, and always ^0.1. The full catalogues may be ex- 
amined for these projected systems, to see how well these agree 
with the spectroscopically determined groups. 

The full CMR catalogues in the region of the R236.1 and 
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Table 10. Nearest CMR candidate to each MOSCA group. 

a Separation between the centroid of the spectroscopic grouping and the 

nearest MF candidate in arc minutes and physical distance (Mpc) at z spec . 

b Spectroscopic redshift of the former grouping. 

c CMR estimated redshift 

d Difference between these two redshifts. 



Field 


ID 


CMR Candidate 
ID 


Separation 12 
(arcmin/ Mpc) 


z b 


z esi 




R110 


la 


cmJ 1428 12.0+330736 


3.995/0.81 


0.196 


0.160 


0.036 


R220 


2a 


cmJ 172333.0+7444 10 


0.670/0.17 


0.260 


0.210 


0.050 


R236 


3a 


cmJ170244.2+515539 


3.228/0.88 


0.297 


0.310 


0.013 


R236 


3b 


cmJ170244.2+515539 


2.092/0.63 


0.347 


0.310 


0.037 


R294 


4a 


cmJ231951.2+123208 


2.187/0.56 


0.268 


0.370 


0.102 


R294 


4b 


cmJ231951.2+123208 


1.147/0.33 


0.325 


0.370 


0.045 


R294 


4c 


cmJ231951.2+123208 


1.082/0.38 


0.454 


0.370 


0.084 



Table 11. Groups from the full CMR-catalogue for those systems flagged 
as "projections". <r is the significance from the CMR algorithm. 



Field 


CMR candidate ID 


Zest 


17 


R236A 


cmJ170240.9+515512 


0.270 


5.15 


R236A 


cmJ170242.7+515222 


0.470 


4.35 


R236A 


cmJ170244.2+515539 


0.310 


5.55 


R236B 


cmJ170248.5+515051 


0.390 


4.55 


R236B 


cmJ170250.6+515506 


0.300 


6.85 


R236B 


cmJ170252.1+515717 


0.490 


4.85 


R294B 


cmJ231945.8+123304 


0.270 


4.85 


R294B 


cmJ231951.2+123208 


0.370 


5.05 



R294_l fields are given in Table ITT1 The candidates are split be- 
tween A and B rotation results for R236_l , as here the two rotations 
overlap 5 . This is not the case for R294.1. 

A cross-correlation analysis between the position of each 
CMR candidate and each MOSCA candidate (position given by 
centroid of redshifts), and their respective redshifts, was performed. 
In this way, the closest match in projected and redshift space was 
located. Fig. 1301 shows this data projected into 2D along a line of 
constant declination (so the R.A. offset gives the approximate sky- 
plane offset). It can be seen that, for R236_l, both the A and B cat- 
alogues identify 3 candidates at approximately the same redshifts: 
two close to the MOSCA groups and one at slightly higher redshift. 
This illustrates that agreement between the A and B redshift esti- 
mates is good, and the agreement with the spectroscopic redshifts 
is also good (^0.05). The possibility of a higher redshift candidate, 
not reached by the depth of the MOSCA spectroscopy, is likely, 
given that it is identified independently in both rotations, and be- 
cause the lower redshift groups agree so well with the spectroscopy. 
For R294_l, two candidates are found. Given that the lowest red- 
shift of the three MOSCA groups in this field (4a) is not significant 
from the space density of early-type galaxies analysis (and will not 
be found by the CMR algorithm, because it does not contain signif- 
icant numbers of early-type galaxies), the most likely interpretation 
is that the CMR-finder detects the two highest redshift groups and 
underestimates the redshifts of both (albeit by only ^0.08). Thus, 
the candidates from the full catalogue are naturally associated with 
the nearest groups in 2D space, and these are then found to be also 
the nearest in redshift space. Therefore, the CMR finder performs 
excellently, correctly finding and separating all the systems identi- 
fied spectroscopically. 

5 R110.1 also has overlapping A and B rotation, although the V-band A- 
rotation data is slightly trailed and so rejected from this analysis. 




AR.A. (arcmin) AR.A (arcrrin) 



Figure 30. R.A. vs z (declination slice) plot for MOSCA (spectroscopically 
determined) groups and CMR candidates R236.1 (left) and R294.1 (right). 
Filled circles are MOSCA groups (note: the lowest-z point in the R294.1 is 
not significant in terms of early-type galaxies); open circles are CMR can- 
didates from A- and B-rotation data (as labelled). Broken horizontal error 
bars denote optical radii of candidates (typically ~1 arcmin): solid lines 
connect optical candidates to nearest spectroscopically confirmed group; 
solid vertical error bars denote width of redshift slice (i.e. CMR estimated 
redshift error, c.f. Fig. 7). For R236.1, two candidates in each rotation are 
seen near the spectroscopically determined group. Also, a higher redshift 
candidate is seen in both rotations. The spectroscopy may not probe deep 
enough to have found members of this group. In R294.1 two candidates are 
found within Az^,0.08 of the most significant (i.e. the two highest redshift) 
spectroscopic groups. 

7.6 Cluster Velocity Dispersion Estimates 

The cluster redshif t and velocity dispersion were calculated fol- 
lowing Bee rs et alJ ll99(l) . They recommend using the median and 
standard deviation when dealing with tiny (n-~5) datasets. Only the 
secure redshifts were considered. Redshifts within ~2000 km s _1 
of the peak in the redshift histogram were extracted and the median 
value was taken to be the cluster redshift. The standard deviation 
was computed, and any value exceeding 3 standard deviations from 
the median excluded (this was only the case for the clusters in field 
R236_l: one value was rejected from each), and the standard devi- 
ation then re-computed. This was then transformed to the velocity 
dispersion in the cluster's rest-frame. The confidence interval for 
the velocity dispersion was found by applying the statistical jack- 
knife technique to the data (for example lCarlberg et alll99fil) . This 
simple resampling technique uses 'pseudo-values' <5; of the data, 
by calculating the difference between a statistical measure, /, cal- 
culated for the whole dataset, and for the dataset with one value 
removed Si = f(xi, x n ) - f(xi, .... Xj, x„). Th e es- 

timate of the variance is [n/(n - 1) £\ 5?] 1/2 (;Efr onlll982h . For 
very small N (~ 3) this error estimate is likely to be highly bi- 
ased as only two data points are being resampled each time and the 
factor of y/n/(n — 1) is likely to be an underestimate. These val- 
ues must be treated cautiously for the three groupings with only 3 
galaxies. 

As described in j4lS mailetail tl998) plotted the Lb measure 
against the X-ray temperature of the most X-ray luminous clusters 
at z~0.2, and found a good correlation. This suggests that Lb may 
be a good tracer of cluster mass. Using the relation between veloc- 
ity dispe rsion and X-ray tem perature of W u et alj ll999h to trans- 
form the Smail et al. 1 1998) data yields a power law fit which is 
consistent with our data, although given the large uncertainties on 
our velocity dispersions, we cannot constrain this relation. We sim- 
ply note that given our limited data, we cannot tell if the relation 
between L_b and a for X-ray luminous clusters still holds for X-ray 
underluminous systems. 

Spectroscopic observations are underway for several of the 
XDCS clusters as part of other projects, and so these should allow 
much more accurate estimates of the velocity dispersions, further 
investigation of this relation, and the use of Lb as a mass estimator. 



An Optical Survey for X-ray Dark Clusters 29 



Table 12. Cluster velocity dispersion estimates. Columns are: ID of redshift 
grouping; Field ID; Number of galaxies (N) used in redshift determination; 
velocity dispersion (<r z ); velocity dispersion in cluster rest-frame; and error 
on this quantity from jack-knife estimate (see text). This error estimate is 
likely to be biased in the presence of very small N (i.e. ~3), and so such 
error estimates should be treated cautiously. Most of the more reliable sys- 
tems show velocity dispersions in the range 300 kms -1 — 700 kms -1 , 
typical of massive groups and low - intermediate mass clusters. 



Gnnipin;j 
ID 


Field 


N 






CT r-est 
(kms^ 1 ) 


(kms^ 1 ) 


la 


R110.1 


3 


0.196440 


0.00031 


78 


95 


2a 


R22CU 


7 


0.259740 


0.00143 


341 


346 


3a 


R236J 


6 


0.297210 


0.00315 


728 


504 


3b 


R236J 


11 


0.347100 


0.00179 


398 


262 


4a 


R294J 


3 


0.268450 


0.00259 


612 


750 


4b 


R294J 


3 


0.325470 


0.00224 


506 


620 


4c 


R294J 


5 


0.453810 


0.00467 


962 


1051 



7.7 Summary of Spectroscopic Results 

Spectroscopy has been undertaken for four XDCS subfields, con- 
taining cluster candidates not showing significant X-ray emission. 
Candidate groups in redshift space were identified, and the signifi- 
cance of these groups evaluated by three different techniques. The 
first two involved bootstrap resampling the Canada-France Red- 
shift Survey using different selection functions. Using a simple 
magnitude limited selection showed (in general agreement with 
Holde n et alJ ll99 9)'s method) that 3 concurrent redshifts was a 
significant grouping; using a Gaussian magnitude selection greatly 
reduced the significance of these groups (showing the technique is 
very sensitive to the simulated selection function), but the draw- 
backs of both these techniques were discussed. An argument based 
on the space-density of early-type galaxies showed that three early- 
type galaxies constituted a robust group. Using this latter argument, 
one group was detected in each of two of the fields, and two groups 
were detected in the other two fields. The colour-magnitude finder 
correctly separated line of sight projected systems and also detected 
a higher redshift system, not revealed by the spectroscopy (most 
likely as galaxies sufficiently faint were not targeted). The CMR 
redshift error is around 0.04, for groups in the redshift range 0.2 - 
0.4. Velocity dispersions for most of these systems are around 300 
- 700 kms -1 (corresponding to massive groups and low - inter- 
mediate mass clusters) but these are estimated from tiny numbers 
of galaxies (~5), and have jack-knife estimated errors of around 
60 - 100% (and for the systems with only 3 redshifts, these errors 
are likely to be underestimated). Finally, the luminosity in early- 
type galaxies versus the velocity dispersion was compared with 
the relation taken for the high X-ray luminosity cluster sample of 
ISmail etall 1 19981) . and seen to be consistent, although the errors 
on the XDCS velocity dispersions are very large. 



8 DISCUSSION, CONCLUSIONS & FUTURE WORK 

The original mot ivati on for this work was the studies of 
iBower et ail ll994h and lBower et alJ ll997l) . which found that in 
an optically selected survey of galaxy clusters at z~0.4, the X- 
ray emission was systematically lower than expected for a non- 
evolving X-ray luminosity function, relative to local samples. Their 
spectroscopic analysis indicated that these systems had velocity 
dispersions comparable to those of more X-ray luminous systems, 
which suggested that if the clusters were virialised then they had 
dynamical masses similar to the more X-ray luminous/ massive 



systems; or that the systems were in fact unrelaxed and their veloc- 
ity dispersions were thus inflated above that of a relaxed system. 

This work has constructed similar optically selected samples, 
albeit from a smaller area (11 deg 2 versus 27 deg 2 ) but with a more 
quantifiable selection function and using more efficient selection 
techniques. The relationship between X-ray luminosity and rich- 
ness (as measured three different ways) shows considerable scatter. 

During the c ourse o f this work, results from a similar study 
bv lDonahue et alJ 1200 lh were published. They conducted an op- 
tical and X-ray surv ey in 23 deep ROSAT fields (4.8 deg 2 ) using 
IPostman et alll993 's Matched Filter algorithm on I-band data. The 
depth of their photometry was about 0.5 magnitudes deeper than 
that of the XDCS, although their are al cove rage was lower by more 
than a factor of two. lDonahue et all 1200 lh detected 57 X-ray can- 
didate clusters and 152 candidates in the optical. Their MF algo- 
rithm detected 74% (26 out of 35) of the most reliable X-ray can- 
didates. This number is in good agreement with the 75% (9 out of 
12) found with the MF algorithm used here. We have shown that an 
even higher recovery rate is possible using CMR techniques (poten- 
tially 10 ou t of 10) and with much more reliable redshift estimates. 

As in [Donahue et alJ 1200 ll) . we find that within their opti- 
cally selected sample, optical and X-ray luminosity are correlated, 
with considerable scatter. Their measure of richness is essentially 
the number of L* galaxies (A in equation |5J contributing to the 
cluster signal at their MF estimated redshift. We show that the MF 
estimated redshifts are much poorer than those estimated from the 
CMR finder. This will potentially increase the scatter of the rela- 
tion. They state that although there is significant scatter within the 
relation, there is no need to impose a bimodal distribution of X- 
ray luminous and X-ray faint clusters. This seems to be borne out 
by this work, as the distribution of detections in Fig. 1181 appears 
continuous. 

We find a scatter of 0.2 dex (a factor of 1.6) in the relation. 
Clearly this is important if all the systems (both X-ray dark and 
bright) are needed for cosmological models. Our X-ray dark clus- 
ters are certainly convincing and in S0we show that they are con- 
firmed by spectroscopy. How we deal with the scatter in cosmolog- 
ical surveys depends on its origin. We consider this below. 

Possible reasons for this scatter include: 

1) Variations in the efficiency of galaxy formation. If galaxy 
formation is more efficient at a given epoch/ environment, then for 
a given mass of gas, a higher fraction can be converted to stars, in- 
creasing the light to mass ratio of a cluster. Furthermore, this leaves 
less gas available for production of X-ray emission, decreasing the 
X-ray luminosity. So, higher galaxy formation efficiency leads to 
increased optical luminosity and decreased X-ray luminosity. This, 
h owever, is not seen in semi-analytic galaxy formation models such 
as lCole etafll200d) . 

2) The dynamical state of the cluster. As mentioned before, if 
a cluster is dynamically unrelaxed then the hot intracluster gas will 
not be centrally concentrated to densities sufficient for X-ray emis- 
sion (SQ. If the cluster galaxies are already in place (as seems to be 
suggested by Stanford et al. 2001) then such a cluster would have 
an unusually low X-ray luminosity for its optical luminosity. High 
resolution observations with Chandra have shown cluster cores are 
far from relaxed l Ma zzotta et all200ll) but what we require here is 
an even more widespread distribution of X-ray properties. 

3) Thermal history of the gas. The presence of cooling gas 
in the cluster raises the ICM density and initially increases X- 
ray luminosity. Conversely, injecting energy into the ICM at early 
times [e.g. by AGN or through s upernovae/ feedback f rom galaxy 
formation l Pon man et all 1 19991: Iwu et alJll998t IVoit et alJl200a 
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Bow er et al I2001M decreases the ICM density and lowers X-ray lu- 
minosity. Both these effects could contribute to scatter in the optical 
- X-ray luminosity relation. The scatter ma y also reflect dif ferent 
levels of preheating from cluster to cluster I M ushotzkv & S chard 
Il997l> . 

4) Projection effects. Groups of galaxies projected along the 
line of sight would appear as higher optical luminosity clusters 
(since the number of galaxies observed is simply additive); whereas 
the X-ray luminosity would appear extremely low for a cluster of 
such optical richness, as the X-ray luminosity scales as the square 
of the gas density. This was shown to probably not be a significant 
factor in J6| by separately considering optical cluster candidates 
flagged as projections. Although, again, the volume probed by this 
survey is relatively small, so large scale filaments viewed 'end-on' 
may be too rare to be included. 

These mechanisms all assume that the fundamental parameter 
is the cluster mass. The best measurement for the cluster mass in 
this paper is the velocity dispersion. This suggested that within the 
(large) errors, a sample of optically selected, X-ray underluminous 
clusters had optical luminosities consistent with those of the most 
X-ray luminous clusters. Clearly, better mass estimates are required 
for a larger number of clusters. Recently. I Yee & Ellingsori 12003) 
have examined the CNOC 1 sample of X-ray luminous clusters, and 
found that B gc , Tx and Lx can be used to infer the dynamical 
mass of these systems to within 30 %. Other possibile m ass esti- 
mators include gravitational lensing 1 Hoek stra et all2002T) or total 
K-band galaxy luminosity llin et all2003l) . 
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